SSNCES 

National Center for 
Education Statistics 



The 

Nation's 

Report 

Card 



NAI 


EP 




91 



U.S. Department of Education 

Institute of Education Sciences 
NCES 2005-464 



The Nation’s Report Card™ 

NAEP 2004 

Trends in Academic Progress 

Three Decades of Student Performance 
in Reading and Mathematics 





What is The Nation's Report Card™? 



The Nations Report Card™, the National Assessment of 
Educational Progress (NAEP), is a nationally representative 
and continuing assessment of what Americas students know 
and can do in various subject areas. Since 1969, assessments 
have been conducted periodically in reading, mathematics, 
science, writing, history, geography, and other subjects. 

NAEP is a congressionally mandated project of the 
National Center for Education Statistics within the 
Institute of Education Sciences of the U.S. Department 
of Education. The Commissioner of Education Statistics 
is responsible, by law, for carrying out the NAEP project 
through competitive awards to qualified organizations. 

By making objective information on student performance 
available to policymakers at the national, state, and local 
levels, NAEP is an integral part of our nations evaluation 
of the condition and progress of education. Only informa- 
tion related to academic achievement and relevant variables 
is collected under this program. The privacy of individual 



students and their families is protected, and the identities of 
participating schools are not released. 

In 1988, Congress established the National Assessment 
Governing Board (NAGB) to oversee and set policy for 
NAEP. The Board is responsible for selecting the subject 
areas to be assessed; setting appropriate student achievement 
levels; developing assessment objectives and test specifica- 
tions; developing a process for the review of the assessment; 
designing the assessment methodology; developing 
guidelines for reporting and disseminating NAEP results; 
developing standards and procedures for interstate, regional, 
and national comparisons; determining the appropriateness 
of all assessment items and ensuring the assessment items 
are free from bias and are secular, neutral, and nonideo- 
logical; taking actions to improve the form, content, use, 
and reporting of results of the National Assessment; and 
planning and executing the initial public release of NAEP 
reports. 



The National Assessment Governing Board 



Darvin M. Winick, Chair 

President 

Winick & Associates 
Dickinson, Texas 

Sheila M. Ford, Vice Chair 

Principal 

Horace Mann Elementary School 
Washington, D.C. 

Francie Alexander 

Chief Academic Officer, 
Scholastic, Inc. 

Senior Vice President, 

Scholastic Education 
New York, New York 

David J. Alukonis 

Chairman 

Hudson School Board 
Hudson, New Hampshire 

Amanda P. Avallone 

Assistant Principal and 
Eighth-Grade Teacher 
Summit Middle School 
Boulder, Colorado 

Honorable Jeb Bush 

Governor of Florida 
Tallahassee, Florida 

Barbara Byrd-Bennett 

Chief Executive Officer 
Cleveland Municipal School 
District 

Cleveland, Ohio 



Carl A. Cohn 

Clinical Professor 
Rossier School of Education 
University of Southern California 
Los Angeles, California 

Shirley V. Dickson 

Educational Consultant 
Laguna Niguel, California 

John Q. Easton 

Executive Director 
Consortium on Chicago School 
Reform 

Chicago, Illinois 

Honorable Dwight Evans 

Member 

Pennsylvania House of 
Representatives 
Philadelphia, Pennsylvania 

David W. Gordon 

Sacramento County Superintendent 
of Schools 

Sacramento County Office of 
Education 

Sacramento, California 

Henry L. Johnson 

Superintendent of 
Education 

Mississippi Department of 
Education 
Jackson, Mississippi 

Kathi M. King 

Twelfth-Grade Teacher 
Messalonskee High School 
Oakland, Maine 



Honorable Keith King 

Member 

Colorado House of 
Representatives 
Colorado Springs, Colorado 

Kim Kozbial-Hess 

Fourth-Grade Teacher 
Fall-Meyer Elementary School 
Toledo, Ohio 

Andrew C. Porter 

Professor 

Leadership Policy and 
Organizations 
Vanderbilt University 
Nashville, Tennessee 

Luis A. Ramos 

Community Relations Manager 
PPL Susquehanna 
Berwick, Pennsylvania 

Mark D. Reckase 

Professor 
Measurement and 
Quantitative Methods 
Michigan State University 
East Lansing, Michigan 

John H. Stevens 

Executive Director 
Texas Business and 
Education Coalition 
Austin, Texas 

Mary Frances Taymans, SND 

Executive Director 
National Catholic 
Educational Association 
Washington, D.C. 



Oscar A. Troncoso 

Principal 

Socorro High School 
Socorro Independent School 
District 
El Paso, Texas 

Honorable Thomas J. Vilsack 

Governor of Iowa 
Des Moines, Iowa 

Michael E.Ward 

Former State Superintendent of 
Public Instruction 
North Carolina Public Schools 
Jackson, Mississippi 

Eileen L. Weiser 

Member, State Board of Education 
Michigan Department of 
Education 
Lansing, Michigan 

Grover J. Whitehurst (Ex officio) 

Director 

Institute of Education Sciences 
U.S. Department of Education 
Washington, D.C. 



Charles E. Smith 

Executive Director, 
NAGB 

Washington, D.C. 




SSNCES 

National Center for 
Education Statistics 



The 

Nation's 

Report 

Card 




NAEP 2004 Trends 
in Academic Progress 

Three Decades of Student Performance 
in Reading and Mathematics 



U.S. Department of Education 

Institute of Education Sciences 
NCES 2005-464 



July 2005 



Marianne Perie 
Rebecca Moran 
Anthony D. Lutkus 

Educational Testing Service 

William Tirre 
Project Officer 

National Center for Education Statistics 



U.S. Department of Education 

Margaret Spellings 

Secretary 

Institute of Education Sciences 

Grover J. Whitehurst 

Director 

National Center for Education Statistics 

Grover J. Whitehurst 

Acting Commissioner 

The National Center for Education Statistics (NCES) is the primary federal entity for collecting, analyzing, and report- 
ing data related to education in the United States and other nations. It fulfills a congressional mandate to collect, collate, 
analyze, and report full and complete statistics on the condition of education in the United States; conduct and publish 
reports and specialized analyses of the meaning and significance of such statistics; assist state and local education agencies in 
improving their statistical systems; and review and report on education activities in other countries. 

NCES activities are designed to address high priority education data needs; provide consistent, reliable, complete, 
and accurate indicators of education status and trends; and report timely, useful, and high quality data to the U.S. 
Department of Education, the Congress, the states, other education policymakers, practitioners, data users, and the 
general public. 

We strive to make our products available in a variety of formats and in language that is appropriate to a variety of 
audiences. You, as our customer, are the best judge of our success in communicating information effectively. If you have 
any comments or suggestions about this or any other NCES product or report, we would like to hear from you. Please 
direct your comments to: 

National Center for Education Statistics 
Institute of Education Sciences 
U.S. Department of Education 
1990 K Street NW 
Washington, DC 20006-5651 

July 2005 

The NCES World Wide Web Home Page address is http://nces.ed.gov 

The NCES World Wide Web Electronic Catalog address is http : / / nces . ed. gov/ pubsearch 

SUGGESTED CITATION 

Perie, M., Moran, R., and Lutkus, A.D. (2005). NAEP 2004 Trends in Academic Progress: Three Decades of Student 
Performance in Reading and Mathematics (NCES 2005-464). U.S. Department of Education, Institute of Education 
Sciences, National Center for Education Statistics. Washington, DC: Government Printing Office. 

FOR ORDERING INFORMATION ON THIS REPORT, WRITE: 

U.S. Department of Education, ED Pubs, P.O. Box 1398, Jessup, MD 20794-1398, or call toll free l-877-4ED-Pubs; 
or order online at http://www.edpubs.org 

CONTENT CONTACT: 

William Tirre 
202-502-7361 
William.Tirre@ed.gov 



The work upon which this publication is based was performed for the National Center for Education Statistics 
by Educational Testing Service, the Education Statistics Services Institute, Pearson Educational Measurement, and Westat. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



Since its inception in 1969, 
NAEP has tracked trends in 
student performance over time. 



Executive Summary 

The citizens and leaders of the United States have long valued education 
as a foundation for democracy, a resource for economic prosperity, and a 
means of realizing personal goals and individual potential. Throughout the 
nations history, the commitment to educate children has grown stronger 
and more inclusive, and in recent decades, so has the expectation that our 
nations schools and teachers be accountable (Ravitch 2002). In 2002, the 
reauthorization of the Elementary and Secondary Education Act — also 
known as the No Child Left Behind (NCLB) Act — further strengthened 
that commitment and expectation. 

Since its inception in 1969, the National Assessment of Educational 
Progress (NAEP) has served the important function of measuring our 
nations educational progress by regularly administering various subject- 
area assessments to nationally representative samples of students. One of 
the primary objectives of NAEP is to track trends in student performance 
over time. This report presents the results of NAEP long-term trend assess- 
ments in reading and mathematics, which were most recently administered 
in 2004 to students ages 9, 13, and 17. Because the assessments have been 
administered at different times in the 35-year history of NAEP, they make 
it possible to chart educational progress since 1971 in reading and 1973 in 
mathematics. Prior to 2004, the most recent long-term trend assessment 
was given in 1999, when results were reported for reading, mathematics, 
and science. 

It should be noted that these long-term trend assessments are different 
from more recently developed assessments in the same subjects that make 
up the “main NAEP” assessment program. Because the instruments and 
methodologies of the two assessment programs are different, comparisons 
between the long-term trend results presented in this report and the main 
assessment results presented in other NAEP reports are not possible. 

Approximately 38,000 students participated in the reading assessment, 
and 37,000 participated in the mathematics assessment. Appendix A pro- 
vides technical information on this study, including sample sizes and a 
description of the significance tests done on each set of results. Only dif- 
ferences that have been determined to be statistically significant at the 0.05 
level after controlling for multiple comparisons are included in this report. 



iv 



EXECUTIVE SUMMARY 



National Results 

National results, provided in chapter 2, are described in 
three ways: average score, score at selected percentiles, 
and percentage of students performing at or above each 
performance level. Student performance in each sub- 
ject area is summarized as an average score on a 0-500 
scale. The five long-term trend performance levels pre- 
sented in this report were set at 50-point intervals on 
the two subject-area scales to provide a verbal descrip- 
tion of student performance at different points on the 
scale. All national findings are reported from 1971 — 
2004 for reading and 1973-2004 for mathematics. The 
primary findings include the following: 

Average Scores 

► Between 1999 and 2004, average reading scores 
increased at age 9 and average mathematics scores 
increased at ages 9 and 13. No measurable changes 
in average scores were found at age 17 in either sub- 
ject between 1999 and 2004. 

► In reading, 9-year-olds scored higher in 2004 than 
in any previous assessment year, with an increase of 
7 points between 1999 and 2004. Average scores for 
age 13 showed no measurable differences between 
assessment years 1999 and 2004, but still were high- 
er in 2004 than the scores in 1971 and 1975. For 
age 17, the average score in 2004 was not measurably 
different from the average score in the first assess- 
ment year, 1971. 

► The average score in mathematics at age 9 was higher 
in 2004 than in any previous year — 9 points higher 
than in 1999. The average score for 13-year-olds 
increased between 1999 and 2004 by 5 points. The 
average score at age 17 was not measurably different 
from 1973 or 1999. 



Percentiles 

► The reading score of 9-year-olds at the median (50th 
percentile) was higher in 2004 than the median score 
in every other year. 

► Overall gains in reading scores for 13-year-olds were 
evident among higher performing students — those 
scoring at the 75th and 90th percentiles — between 
1971 and 2004. 

► Seventeen-year-olds showed no measurable improve- 
ments in reading scores at any of the selected 
percentiles between 1999 and 2004 or between 1971 
and 2004. 

► Mathematics scores for 9-year-olds at each of the 
selected percentiles showed gains between 1978 and 
2004, increasing 26 points at the 10th percentile, 23 
points at the 50th percentile, and 18 points at the 
90th percentile. 

► The mathematics score for 13-year-olds at each of 
the five percentile levels was higher in 2004 than in 
every previous assessment year, except at the 10th 
percentile. 

► Mathematics scores for 17-year-olds in 2004 showed 
no measurable change since 1992 at any of the five 
percentiles. 

Performance Levels 

► The partially developed skills and understanding 
associated with reading at level 200 were demonstrat- 
ed by 70 percent of 9-year-olds in 2004, more than 
in any other assessment year except 1980; by 94 per- 
cent of 13-year-olds; and by almost all 17-year-olds. 

► The percentages of 13-year-olds and 17-year-olds 
who demonstrated the ability to interrelate ideas and 
make generalizations in reading (level 250) were 61 
percent and 80 percent, respectively, in 2004, not 
measurably different from those in 1971 and 1999. 

► Reading performance at or above level 300 — 
understanding complicated information — was 
demonstrated by 38 percent of 17-year-olds in 2004, 
down from 41 percent a decade earlier in 1994. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



► The beginning skills and understandings character- 
istic of level 200 in mathematics were demonstrated 
by 89 percent of 9-year-olds in 2004, more than in 
any other assessment year. Approximately 99 percent 
of 13-year-olds also demonstrated at least this level 
of performance in 2004. 

► At age 13, the percentages of students at level 300 in 
mathematics increased from 17 percent in 1990 to 
23 percent in 1999 and then to 29 percent in 2004. 
Students at this level could perform moderately com- 
plex procedures and use logical reasoning to solve 
problems. In 2004, 39 percent of 17-year-olds were 
at or above level 300 in mathematics, an increase of 
7 percentage points from 1978. 

► Across the assessment years in mathematics, between 
5 and 8 percent of 17-year-olds performed at level 
350, the highest performance level, in which stu- 
dents applied a range of reasoning skills to solve mul- 
tistep problems. 

Student Group Results 

Chapter 3 describes the average scores for various 
groups of students, including male and female students; 
White, Black, and Hispanic students; and student- 
reported levels of parents’ education, which included 
less than high school, graduated from high school, 
some education after high school and graduated from 
college. Some of the results were as follows: 

Gender 

► At all three ages in 2004, female students had higher 
average reading scores than their male counterparts. 

► In 2004, there was no measurable difference between 
the average mathematics scores of male and female 
students at age 9, but at ages 13 and 17, male stu- 
dents scored higher on average than female students. 

► The gender gap for 9-year-olds f reading scores in 
2004 was smaller than the gaps in the first three 
assessment years and 1996. This gap did not change 
measurably between 2004 and any previous assess- 
ment year for 13-year-olds. This score gap in 2004 
showed no measurable difference for 17-year-olds 
from the gap in 1999 or 1971. 



Race/Ethnicity 

► White students had higher average reading scores in 
2004 than in 1971 at ages 9 and 13. 

► For Black students at all three ages, average reading 
scores in 2004 were higher than in 1971. 

► Although White students continue to outscore 
Black students, the White-Black score gap in read- 
ing narrowed from 1971 to 2004 at all three ages. 
The White-Black reading score gap for 9-year-olds 
decreased from 35 points in 1999 to 26 points in 
2004. 

► For Hispanic students, the average reading score at 
age 9 was higher in 2004 than in any other assess- 
ment year. Their average score at age 13 was higher 
in 2004 than in 1975, but not measurably different 
from that in 1999. No measurable difference was 
found between the average score for Hispanic stu- 
dents at age 17 in 2004 and that in 1999. 

► Although White students continue to outscore 
Hispanic students, the White-Hispanic reading score 
gap for students at age 9 in 2004 was smaller than 

it was in 1994, 1984, 1980, and 1975. The White- 
Hispanic reading score gap for 13-year-olds showed 
no measurable difference between 2004 and 1999 or 
1975. The score gap between White and Hispanic 
students at age 17 was measurably smaller in 2004 
than in 1975. 

► White students at all three ages scored higher, on 
average, in 2004 than in 1973 in mathematics. 

► The average mathematics scores for Black students 
were higher in 2004 than in 1973 at all three ages. 
Average scores for Black students at ages 9 and 13 
were higher in 2004 than in any previous assessment 
year. 

► The differences in average scores for White and 
Black students at all ages decreased between the first 
(1973) and the most recent (2004) assessment in 
mathematics, although White students continued to 
outscore Black students in 2004. During this same 
period, the White-Black score gaps in mathematics 
narrowed by 12, 19, and 12 points for ages 9, 13, 
and 17, respectively. 



EXECUTIVE SUMMARY 



► Hispanic students’ performance in mathematics was 
higher at all three ages in 2004 than in any assess- 
ment year from 1973 through 1982. Average scores 
for Hispanic students at ages 9 and 13 were higher in 
2004 than in any previous assessment year. 

► White students scored higher on average than 
Hispanic students at all three age levels in 2004. For 
ages 13 and 17, the White-Hispanic score gap was 
smaller in 2004 than in 1973, but for age 9 there 
was no measurable difference in the size of the score 
gap between the first (1973) and most recent (2004) 
assessment year. 

Parents' Education 

► In 2004, the percentage of students reporting that at 
least one parent graduated from college has increased 
since 1980 for reading and 1978 for mathematics, 
while the percentage of students reporting that the 
highest level of education for their parents was a high 
school diploma or less has decreased. 

► At age 13, there have been no measurable changes in 
average reading scores between 2004 and any previ- 
ous assessment year regardless of the level of parents’ 
education reported by the student. 

► The average reading score for 17-year-olds who 
indicated that at least one parent had some educa- 
tion after high school was lower in 2004 than in any 
previous assessment year. For 17-year-olds who indi- 
cated that at least one parent graduated from college, 
the average score in 2004 (298) was lower than the 
average scores in 1990 (302) and 1984 (302). 

► Students who reported that their parents had less 
than a high school education showed no measurable 
change in average mathematics score between 1999 
and 2004 at either age 13 or 17, but their 2004 
scores were higher than those in 1978. 



► For students whose parents’ highest education level 
was high school graduation or some education after 
high school, the average mathematics score at age 
13 was higher in 2004 than in any other assess- 
ment year, while at age 17 there were no measurable 
changes between 1978 and 2004. 

► For students with at least one parent who gradu- 
ated from college, the average mathematics score in 
2004 was higher than in any other assessment year at 
age 13; no measurable difference was seen at age 17 
between 1978 and 2004. 

Contextual Variables 

As described in chapter 4, examining student scores in 
the context of their learning and home environments 
provides useful information. Learning and home factors 
for which trends are reported include students’ reports 
of how often they read for fun, completed homework, 
used computers, and watched television, and the 
advanced mathematics courses they had taken. Some of 
the findings include the following: 

Homework. Students who took the reading assessment 
were asked how many hours they had spent on home- 
work the previous day. 

► The percentage of students at age 9 indicating that 
no homework was assigned or that they did not do 
any homework decreased between 1984 and 2004. 

In 2004, a greater percentage of 9-year-olds indicated 
that they spent less than 1 hour on homework than 
in any other year in which the question was asked. 

► In 2004, the average reading score of 9-year-olds 
who spent less than 1 hour on homework was higher 
than the average reading scores of students who did 
not do the homework that was assigned or who 
spent more than 2 hours on homework. 

► At age 13, the percentage of students spending less 
than 1 hour on homework increased from 36 percent 
in 1984 to 40 percent in 2004. At the same time, 
the percentage of students spending 1 to 2 hours on 
homework decreased from 29 percent in 1984 to 26 
percent in 2004. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



vii 



► At age 13, students who spent 1 to 2 hours or 2 or 
more hours on homework had higher average read- 
ing scores than their peers who spent less than 1 
hour on homework, did not do their homework, or 
did not have any homework to do. 

► At age 17, the percentage of students reporting that 
they were not assigned homework increased from 22 
to 26 percent. At the same time, the percentage of 
17-year-olds indicating they had spent 1 to 2 hours 
on homework the previous day decreased from 27 to 
22 percent between 1984 and 2004. 

► At age 17, students who spent 2 or more hours on 
homework had higher average reading scores in 2004 
than those who spent 1 to 2 hours, whose scores 
were higher than those who spent less than 1 hour, 
whose scores in turn were higher than those who did 
not do any homework. 

Reading for Fun. Students who took the reading 

assessment were asked to estimate how often they read 

for fun. 

► There were no measurable changes between 1984 
and 2004 in the percentage of 9-year-olds indicat- 
ing that they read for fun almost every day. At ages 
13 and 17, the percentage saying they read for fun 
almost every day was lower in 2004 than in 1984. 
This trend was accompanied by an increase over the 
same 20-year time period in the percentage indicat- 
ing that they never or hardly ever read for fun. 

► At all three ages, students who indicated that they 
read for fun almost every day had higher average 
reading scores in 2004 than those who said that they 
never or hardly ever read for fun. Students at all 
three age levels who said that they read for fun once 
or twice a week had higher average scores than those 
who never or hardly ever read for fun. 



Computer Access and Usage. Students at ages 13 and 17 
who took the mathematics assessment were asked three 
questions about their access to computers and how they 
used them. 

► The percentage of 13-year-olds with access to com- 
puters in schools increased from 12 percent in 1978 
to 37 percent in 2004. The percentage of students 
receiving instruction in computers at age 13 also 
increased, from 14 percent in 1978 to 48 percent 
in 2004. In the 2004 assessment, 69 percent of 
13-year-olds said that they had used a computer to 
solve a mathematical problem. 

► Similar increases were also seen among 17-year- 
olds, where the percentage of students with access 
to a computer in school increased by 33 percentage 
points between 1978 and 2004. The percentage of 
17-year-olds using a computer to solve mathemat- 
ics problems increased from 46 percent in 1978 to 
66 percent in 1999, then to 70 percent in 2004. In 
that year, 36 percent reported that they had studied 
mathematics using computers. 

► There were no measurable differences in mathemat- 
ics scores between 13-year-olds who responded 
positively and those who responded negatively to any 
of the computer access and usage questions in 2004. 
At age 17, students who indicated that they had 
access to a computer at school scored 5 points higher 
in 2004 than students who did not have such access. 

► In 2004, students at age 17 who reported that they 
had used a computer to solve a mathematical prob- 
lem scored 6 points higher on average than students 
who had not used a computer for that purpose. 

There was no measurable difference in average math- 
ematics scores for 17-year-olds based on whether or 
not they had studied mathematics using computers. 



VIII 



EXECUTIVE SUMMARY 



Course-Taking Patterns in Mathematics. Students at age 
17 who took the mathematics assessment were asked 
to check all the mathematics courses they had taken or 
were currently taking. The highest course checked was 
used for the analyses. 

► A greater percentage of 17-year-olds indicated they 
were taking or had taken calculus in 2004 than in 
any previous assessment year. The percentage tak- 
ing second-year algebra increased from 37 percent 
in 1978 to 53 percent in 2004, while the percentage 
of students who indicated that the highest level of 
mathematics they had taken by age 17 was pre- 
algebra or algebra was lower in 2004 than in 1978. 

► The trend towards higher-level course-taking was 
seen across all three racial/ethnic groups shown. The 
percentage of White, Black, and Hispanic students 
who indicated that their highest course was second- 
year algebra was higher in 2004 than in 1978. In 
2004, a higher percentage of White students took 
calculus (19 percent) compared to Black students at 
the same age (8 percent). At 14 percent, the percent- 
age of Hispanic students taking calculus was not 
measurably different from the percentage of either 
White or Black students in 2004. 



2004 Bridge Study 

Several changes were made to the long-term trend 
assessment in 2004 to align it with current assess- 
ment practices and policies applicable to the NAEP 
main assessments. These changes, discussed in detail in 
chapter 5, included replacing items that had outdated 
material, eliminating blocks of items for subjects no 
longer reported, replacing background questions, and 
changing some administration procedures. In addi- 
tion, the 2004 modified assessment provided for the 
inclusion of and accommodations for students with dis- 
abilities and English language learners. 

A bridge study was conducted to ensure that the 
interpretation of the assessment results remains con- 
stant over time. A bridge study involves administering 
two assessments: one that replicates the assessment 
given in the previous assessment year (a bridge assess- 
ment), and one that represents the new design (a 
modified assessment). In 2003-2004, students were 
randomly assigned to take either the bridge assessment 
or the modified assessment. The bridge assessment rep- 
licated the instrument given in 1999 and used the same 
administration techniques. The modified assessment 
included the new items and features discussed above. 
This modified assessment will provide the basis of com- 
parison for all future assessments, and the bridge study 
will link its results back to the results of the past 33 
years. The results from the bridge study are presented 
in chapters 2 and 4, and comparisons between the two 
assessments are provided in chapter 5. 

► Comparing the results of the modified and bridge 
assessments demonstrates that the link between 
the 2004 bridge and modified assessments was 
successful. 



Acknowledgments 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



ix 



The authors wish to thank all of those who contributed to the design, writing, production, and review of this 
report for their thoughtful critique, careful fact checking, insightful suggestions, and creativity. In particular, the 
authors acknowledge the contributions of Steve Gorman, James Griffith, Lisa Hudson, Andrew Kolstad, Taslima 
Rahman, and Marilyn Seastrom of the National Center for Education Statistics (NCES); Mary Crovo, Larry 
Feinberg, Ray Fields, Susan Loomis, and Sharif Shakrani of the National Assessment Governing Board; and Tajuana 
Bates, Yang Chin, Kim Gettis, Dana Kelly, and Alan Vanneman of the Education Statistics Services Institute. 
Outside reviewers included Lisa Clarke and Nancy Horkay. 

This report would not have been possible without the ETS team of researchers and data analysts who designed 
the administration and analyzed the results to produce the data included in this report. These ETS staff include 
Andreas Oranje, Brenda Tay-Lim, Amy Dresher, Lydia Gladkova, and Catherine McClellan from the research 
group; and Tatyana Petrovicheva, Yuxin Tang, David Freund, Laura Jerry, Matthew Kandathil, Edward Kulick, 
Youn-Hee Lim, Mei-Jang Lin, and Haiyang Liu from the data analysis group. Gloria Dion, David Garber, and 
Patricia Donahue prepared the items for release. Marilyn Binkley from NCES also worked on the assessment 
development and design of the long-term trend. 

Many people at ETS were responsible for the careful edit and review of these chapters. In particular the authors 
wish to thank our advisors, Nancy Mead and Wendy Grigg; our editors, Arlene Weiner, Mary Daane, and Linda 
Myers; and our quality control staff, Ming Kuang, Carmen Payton, Janice Goodis, and Chrystal Murphy. Finally, 
we wish to thank the ETS production staff, Loretta Casalaina, Rick Hasney, and Susan Mills, for transforming our 
final chapters into a crisp and attractive report, and our special staff assistant, Karen Damiano, who helped ensure 
that the final details were completed. 



X 



CONTENTS 



Contents 



Executive Summary iii 

Acknowledgments ix 

Chapter 1 Introduction 1 

NAEP Assessments 1 

Overview of the 2004 Long-Term Trend Assessments 2 

2004 Bridge Study 3 

Content of the Assessments 4 

The Long-Term Trend Background Questionnaires 4 

The Student Sample 4 

Reporting the Trend Results 5 

About This Report 5 

Cautions in Interpreting the Long-Term Trend Results 6 

Chapter 2 National Trends in Academic Achievement 7 

How the Results Are Presented 8 

National Trends in Reading Performance 9 

Reading Perform a nee- Level Descriptions 13 

National Trends in Mathematics Performance 16 

Mathematics Perform a nee- Level Descriptions 21 

Summary 24 

Chapter 3 Trends in Academic Achievement Among Student Groups .... 27 

Description of Student Groups 28 

Trends in Reading Scores by Student Groups 29 

Trends in Mathematics Scores by Student Groups 38 

Summary 47 

Chapter 4 Trends in Students’ School and Home Experiences 49 

Contextual Factors Associated With Reading 50 

Contextual Factors Associated With Mathematics 56 

Summary 65 

Chapter 5 Comparison of Bridge and Modified Assessments 67 

Specific Changes Made for the 2004 Long-Term Trend Assessment. ... 68 

Bridge Study 70 

Comparison of Bridge and Modified Results for Reading 71 

Comparison of Bridge and Modified Results for Mathematics 74 

Summary 76 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



xi 



Chapter 6 Sample Questions 77 

Reading: 9-Year-Olds 78 

Reading: 13-Year-Olds 80 

Reading: 17-Year-Olds 82 

Mathematics: 9-Year-Olds 85 

Mathematics: 13-Year-Olds 86 

Mathematics: 17-Year-Olds 87 

References 89 

Appendix A Overview of Procedures Used in the 

2004 NAEP Long-Term Trend Assessments 91 

The Reading Assessment 93 

The Mathematics Assessment 95 

Sampling and Data Collection 96 

Student Exclusion Rates 103 

Data Collection and Scoring 104 

Weighting 106 

Data Analysis and IRT Scaling 107 

Setting the Performance Levels Ill 

NAEP Reporting Groups 112 

Estimating Variability 113 

Drawing Inferences from the Results 114 

Analyzing Group Differences in Averages and Percentages 115 

Conducting Multiple Tests 115 

Cautions in Interpretations 117 

Appendix B Percentage Distribution of Students Taking 

Each Assessment in 2004 Across Various Student Groups 119 

Appendix C Glossary of Terms 123 



xii 



CONTENTS 



List of Tables 



Table 5-1. 

Total number of questions of each format administered in the bridge 

and modified reading assessments, by age: 2004 69 

Table 5-2. 

Total number of questions of each format administered in the bridge 

and modified mathematics assessments, by age: 2004 69 

Table A-l. 

Target student sample size in reading and mathematics, 

by type of school and type of assessment: 2004 97 



Table A-2. 

Number of schools and estimated number of students within 
the sampled primary sampling units (PSUs) for public schools, 



by NAEP region and metropolitan status: 2004 99 

Table A-3. 

Number of schools and estimated number of students within 

the sampled primary sampling units (PSUs), by private school 

affiliation: 2004 99 

Table A-4. 

Student sample sizes for the reading long-term trend scaling: 

1971-2004 101 

Table A-5. 

School and student participation rates for the reading long-term trend 
assessments: 1971-2004 101 

Table A-6. 

Student sample sizes for the mathematics long-term trend 

scaling: 1978-2004 102 

Table A-7. 

School and student participation rates for the mathematics 

long-term trend assessments: 1973-2004 102 

Table A-8. 

Student exclusion rates for the reading and mathematics 

long-term trend assessments: 1990-2004 103 

Table A-9. 

Percentage exact agreement between readers for the reading 

long-term trend assessment scoring: 2004 105 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



XIII 



Table A-10. 

Percentage exact agreement between readers for the mathematics 
long-term trend assessment scoring: 2004 105 

Table A-ll. 

Summary item response rates for the reading long-term trend 

assessment, by different types of response: 2004 108 

Table A-12. 

Summary item response rates for the mathematics long-term trend 
assessment, by different types of response: 2004 108 

Table A-13. 

Trends in reading and mathematics average scale scores for students 

ages 9, 13, and 17: 1971-2004 114 

Table A- 14. 

Example of False Discovery Rate comparisons of average scale scores 

for different groups of students 116 

Table B-l. 

Percentage of students assessed in reading at ages 9, 13, and 17, 



by type of assessment and student and school characteristics: 2004. . . 120 

Table B-2. 

Percentage of students assessed in mathematics at ages 9, 13, and 17, 
by type of assessment and student and school characteristics: 2004. . . 120 



Table B-3. 

Percentage of students assessed in reading at ages 9, 13, and 17, 
by student and school characteristics: 1971, 1999, and 2004 121 

Table B-4. 

Percentage of students assessed in mathematics at ages 9, 13, and 17, 
by student and school characteristics: 1978, 1999, and 2004 121 



3 



xiv 



CONTENTS 



List of Figures 



Figure 1-1. 

Comparison of the old and new long-term trend assessment 

Figure 2-1. 

Trends in average reading scale scores for students ages 9, 13, and 17: 
1971-2004 10 

Figure 2-2. 

Trends in reading scale score at selected percentiles for students 

ages 9, 13, and 17: 1971-2004 11 

Figure 2-3. 

Trends in percentages at or above reading performance levels 

for students ages 9, 13, and 17: 1971-2004 14 

Figure 2-4. 

Trends in average mathematics scale scores for students 

ages 9, 13, and 17: 1973-2004 17 

Figure 2-5. 

Trends in mathematics scale score at selected percentiles for students 
ages 9, 13, and 17: 1978-2004 19 

Figure 2-6. 

Trends in percentages at or above mathematics performance levels 

for students ages 9, 13, and 17: 1978-2004 22 

Figure 2-7. 

Summary of trends in reading and mathematics average scale scores 

for students ages 9, 13, and 17: 1971-2004 24 

Figure 2-8. 

Summary of trends in reading and mathematics scale score percentiles 
for students ages 9, 13, and 17: 1971-2004 24 

Figure 2-9. 

Summary of trends in reading and mathematics percentages at or above 
performance levels for students ages 9, 13, and 17: 1971-2004 25 

Figure 3-1. 

Trends in average reading scale scores and score gaps for students 

ages 9, 13, and 17, by gender: 1971-2004 30 

Figure 3-2. 

Trends in average reading scale scores and score gaps for White students 
and Black students ages 9, 13, and 17: 1971-2004 32 

Figure 3-3. 

Trends in average reading scale scores and score gaps for White students 
and Hispanic students ages 9, 13, and 17: 1971-2004 34 

Figure 3-4. 

Trends in average reading scale scores for students ages 13 and 17, 



by student-reported parents’ highest level of education: 1980-2004 .... 37 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



Figure 3-5. 

Trends in average mathematics scale scores and score gaps for students 



ages 9, 13, and 17, by gender: 1973-2004 39 

Figure 3-6. 

Trends in average mathematics scale scores and score gaps for White 
students and Black students ages 9, 13, and 17: 1973-2004 41 

Figure 3-7. 

Trends in average mathematics scale scores and score gaps for White 
students and Hispanic students ages 9, 13, and 17: 1973-2004 43 

Figure 3-8. 



Trends in average mathematics scale scores for students ages 13 and 17, 
by student-reported parents’ highest level of education: 1978-2004 .... 45 



Figure 3-9. 

Summary of trends in reading and mathematics average scale scores 

for students ages 9, 13, and 17, by gender: 1971-2004 47 

Figure 3-10. 

Summary of trends in reading and mathematics average scale scores 

for students ages 9, 13, and 17, by race/ethnicity: 1971-2004 48 

Figure 3-11. 

Summary of trends in reading and mathematics average scale scores 

for students ages 13 and 17, by student- re ported parents’ highest level 

of education: 1978-2004 48 

Figure 4-1. 

Average reading scale scores for students ages 9, 13, and 17, 

by amount of time spent on homework: 2004 50 

Figure 4-2. 

Percentages of students ages 9, 13, and 17, by amount of time spent 
on homework: 1980, 1984, 1999, and 2004 51 

Figure 4-3. 

Average reading scale scores for students ages 9, 13, and 17, 

by pages read per day in school and for homework: 2004 52 

Figure 4-4. 

Percentages of students ages 9, 13, and 17, by pages read per day 

in school and for homework: 1984, 1999, and 2004 53 

Figure 4-5. 

Average reading scale scores for students ages 9, 13, and 17, 
by frequency of reading for fun: 2004 55 

Figure 4-6. 

Percentages of students ages 9, 13, and 17, by frequency of reading 
for fun: 1984, 1999,2004 



55 



xvi 



CONTENTS 



Figure 4-7. 

Average mathematics scale scores for students age 13, 

by type of mathematics course: 2004 56 

Figure 4-8. 

Percentage of students age 13, by type of mathematics course: 

1986. 1999, and 2004 56 

Figure 4-9. 

Average mathematics scale scores for students age 17, 

by highest mathematics course taken: 2004 57 

Figure 4-10. 

Percentage of students age 17, by highest mathematics course taken: 

1978. 1999, and 2004 58 

Figure 4-11. 

Percentage of students age 17, by gender and highest mathematics 

course taken: 1978, 1999, and 2004 58 

Figure 4-12. 

Percentage of students age 17, by race/ethnicity and highest mathematics 
course taken: 1978, 1999, and 2004 59 

Figure 4-13. 

Average mathematics scale scores for students ages 13 and 17, 

by access to and use of computers for mathematics: 2004 61 

Figure 4-14. 

Percentages of students ages 13 and 17, by availability and use 
of computers: 1978, 1999, and 2004 62 

Figure 4-15. 

Average mathematics scale scores for students age 17, 

by frequency of doing mathematics homework: 2004 63 

Figure 4-16. 

Percentage of students age 17, by frequency of doing mathematics 
homework: 1978, 1999, 2004 63 

Figure 4-17. 

Average mathematics scale scores for students ages 9, 13, and 17, 
by amount of daily television watching: 2004 64 

Figure 4-18. 

Percentages of students ages 9, 13, and 17, by amount of daily 
television watching: 1978, 1982, 1999, and 2004 



65 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



xvii 



Figure 5-1. 

Average reading scale scores for students ages 9, 13, and 17 for bridge 
and modified assessments: 2004 71 

Figure 5-2. 

Average reading scale scores for students ages 9, 13, and 17 for bridge 
and modified assessments, by gender: 2004 72 

Figure 5-3. 

Average reading scale scores for students ages 9, 13, and 17 for bridge 
and modified assessments, by race/ethnicity: 2004 73 

Figure 5-4. 

Average mathematics scale scores for students ages 9, 13, and 17 

for bridge and modified assessments: 2004 74 

Figure 5-5. 

Average mathematics scale scores for students ages 9, 13, and 17 

for bridge and modified assessments, by gender: 2004 75 

Figure 5-6. 

Average mathematics scale scores for students ages 9, 13, and 17 

for bridge and modified assessments, by race/ethnicity: 2004 76 

Figure A-l. 

Changes to the 1999 reading long-term trend assessment booklets 
implemented in the 2004 reading bridge assessment 94 

Figure A-2. 

Changes to the 1999 mathematics long-term trend assessment booklets 
implemented in the 2004 mathematics bridge assessment 96 

Figure A-3. 

Linking design for the long-term trend assessment: 2004 



110 



THIS PAGE INTENTIONALLY LEFT BLANK. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



The long-term trend assessment 
has been measuring student 
progress in reading for 33 years 
and in mathematics for 31 years. 



Chapter 1 
Introduction 



The citizens and leaders of the United States have long valued education 
as a foundation for democracy, a resource for economic prosperity, and a 
means of realizing personal goals and individual potential. Throughout the 
nations history, the commitment to educate children has grown stronger 
and more inclusive, and in recent decades, so has the expectation that our 
nations schools and teachers be accountable (Ravitch 2002). In 2002 the 
reauthorization of the Elementary and Secondary Education Act — also 
known as the No Child Left Behind (NCLB) Act — further expanded that 
commitment and expectation. 



As educators and policymakers turn their attention to student achieve- 
ment as measured by assessments, examining trends — student performance 
now compared to in the past — can inform efforts to increase student per- 
formance in the future. The National Assessment of Educational Progress 
(NAEP) is one of the most important resources for monitoring the student 
achievement. Since its inception in 1969, NAEP has served the important 
function of measuring our nations educational progress by regularly admin- 
istering various subject-area assessments to nationally representative samples 
of students. One of the primary objectives of NAEP is to track trends in 
student performance over time. This report presents the results of NAEP 
long-term trend assessments in reading and mathematics, which were 
administered in the 2003-2004 school year (referred to hereafter as 2004) 
to students ages 9, 13, and 17. Because the same assessments have been 
administered at different times in the 35-year history of NAEP, they make 
it possible to chart educational progress since 1971 in reading and 1973 in 
mathematics. 



The specific focus of this long-term trend report is to compare student 
performance in 2004 to past performance, measured by the most recent 
assessment in 1999 and previous assessments back to the early 1970s. 



NAEP Assessments 

NAEP is a project of the National Center for Education Statistics (NCES) 
within the Institute of Education Sciences of the U.S. Department of 
Education. The National Assessment Governing Board (NAGB), an inde- 
pendent group created by Congress in 1988, provides policy direction for 
NAEP. (Information about NAGB can be found on its website, http:// 
www.nagb.org/ .) 



2 



CHAPTER 



1 



NAEP includes two components: the long-term trend 
assessments and the main assessments. The existence 
of the two national assessment programs — long-term 
trend and main — makes it possible for NAEP to meet 
two important objectives. The long-term trend pro- 
gram uses substantially the same assessments decade 
after decade, each time a subject is assessed, in order to 
measure student progress in that subject over time. In 
contrast, the main NAEP assessments are periodically 
adapted to reflect contemporary curriculum policies, 
content currently in use in the nations schools, and 
improvements in techniques of educational measure- 
ment. In this way, main NAEP can provide valid data 
for those seeking evidence for contemporary questions, 
and long-term trend NAEP can provide data for evalu- 
ating change over long periods. For example, while the 
current main NAEP reading assessment, given in 2005, 
was first administered in 1992, the long-term trend 
reading assessment dates back to 1971. 

This report presents the results from the long-term 
trend assessments only. Because the long-term trend 
assessments use different questions from those used in 
the main assessments, and because students are sampled 
by age for the long-term trend assessments, rather than 
by grade as in the main assessments, it is not possible to 
compare results from the two assessment programs. 



Overview of the 2004 Long-Term Trend 
Assessments 

The long-term trend assessment originally was given 
in four subjects: mathematics, science, reading, and 
writing. At the time of the last long-term trend report 
(1999), NAGB discontinued the assessment in writing 
for technical reasons. More recently, NAGB decided 
that changes were needed to the design of the science 
assessment and, given recent advances in the field of 
science, to its content. For instance, many science 
questions that were written in the late 1960s are no 
longer relevant, as they were first written before Neil 
Armstrong set foot on the moon, before computers 
could fit onto a desk, and without the knowledge of 
many medical and biotechnology breakthroughs of the 



late 20th century. NAGB decided that the long-term 
trend assessment in science required technical stud- 
ies of the required changes, so that valid comparisons 
between the updated assessment and the original assess- 
ment could still be made. To allow time to update the 
assessment and study the changes, the decision was 
made not to assess science in 2004. 

According to NAGB s new policy, reading and math- 
ematics would continue to be assessed by the long-term 
trend and main NAEP instruments, but science and 
writing would be assessed only in main NAEP. As a 
result, changes were needed to separate out the sets of 
questions (blocks) for science and writing, which had 
been intermixed with the reading and mathematics 
blocks in the long-term assessment instruments. New 
booklets consist only of reading or only of mathematics 
blocks. The changes provided an opportunity to bring 
other aspects of the assessment up to date. Considerable 
progress in testing theory has been made since the late 
1960s, and the 2004 administration provided a plat- 
form to bring these improvements to the long-term 
trend assessments, in areas such as scoring and scal- 
ing. In addition, main NAEP assessments had begun 
providing accommodations to allow students with dis- 
abilities and students who were not fluent in English 
to participate. In 2004, it was possible to implement 
the modifications to the long-term trend assessments 
resulting in the assessment of a greater proportion of 
students using accommodations. 

Any time changes are made in a long-term trend 
assessment, studies are required to ensure that the 
results can continue to be reported on the same trend 
line — that is, that they are validly comparable to earlier 
results. So analyses were needed to ensure that the 2004 
results under the new design were comparable to the 
results from 1971 through 1999, under the design that 
existed earlier. Therefore, two assessments were con- 
ducted in 2004. The modified assessment used the 
new design, and the “bridge” assessment replicated the 
former design. Comparisons of the results can then 
detect any shifts in results due to changes in test design. 
This bridge assessment links the old assessments to the 



new one. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



2004 Bridge Study 

This section of the report presents a brief description 
of the 2004 bridge study, the modified assessment, 
and the long-term trend instruments. (More detailed 
information about the instruments and methodol- 
ogy is provided in appendix A.) The changes made 
for the modified 2004 assessment included replacing 
items containing outdated material, eliminating blocks 
of items for subjects no longer reported, replacing 
background questions, allowing accommodations for 
students who needed them, and changing some admin- 
istrative procedures. For example, previous long-term 
trend assessments in mathematics included an audio 
portion that paced students, so they were always at the 
same place in the test booklet at the same time. The 
audiotape was eliminated in the modified design so 
that students could move at their own pace within each 
section. Another example is that students used to have 
the option of selecting “I don’t know” as a response to 
a multiple-choice item. That response was eliminated 
in the modified assessment. Also, in prior assessments, 
the student’s race/ethnicity was reported based on a 
test administrator’s classification of the student’s visual 
appearance. In 2004, both schools and students were 
asked to report each student’s race/ethnicity as part 
of the school and student questionnaires. Finally, the 
2004 modified assessment provided for the inclusion of 
and accommodations for students with disabilities and 
English language learners. 

The changes were intended to improve the valid- 
ity of the results while continuing to maintain the 
integrity of the long-term trend. Thus, studies were 



needed to ensure that the modifications did not affect 
the interpretation of the results. In other words, it was 
important to assess whether any changes in scores were 
due to actual changes in student performance rather 
than changes in the assessments themselves that may 
have made them easier or harder. 

The bridge study was conducted to ensure that the 
interpretation of the assessment results remains con- 
stant over time. A bridge study involves developing two 
assessments: one that replicates the assessment given in 
the previous assessment year using the same questions 
and administration procedures (a bridge assessment), 
and one that represents the new design (a modified 
assessment). In 2004, students were randomly assigned 
to take either the bridge assessment or the modified 
assessment. The bridge assessment replicated the instru- 
ment given in 1999 and used the same administration 
techniques. The modified assessment included the new 
items and features discussed previously. This modified 
assessment will provide the basis of comparison for all 
future assessments, and the bridge will link its results 
back to the results of the past 30 years (see figure 1-1). 
Further detail on this study is provided in appendix A. 

This report will be the final report of new results 
acquired under the old design using the bridge assess- 
ment. The greater part of the report uses the results 
from the bridge assessment to maintain the trend lines 
from 1971 (in reading) and 1973 (in mathematics). 
Differences between the old and modified formats are 
discussed only in one chapter, chapter 5. Beginning in 
2008, only the modified design will be used, and the 
results will be linked back to the previous assessments 
through the 2004 bridge study. 



Figure 1-1. Comparison of the old and new long-term trend assessment 



Old Long-Term Trend Assessment (Bridge & 1999) 



Bridge 



Modified Long-Term Trend Assessment 




• “I don’t know” option 

• Observed race/ethnicity 

• Audio-paced portion 

• No accommodations for SD/ELL students 



• No “I don’t know” option 

• School-reported race/ ethnicity 

• Self-paced throughout each section 

• Accommodations permitted 




4 



CHAPTER 



1 



Content of the Assessments 

The content of the NAEP long-term trend reading 
and mathematics assessments has not changed since 
its beginning. The reading assessment contains a range 
of reading materials, from simple narrative passages to 
complex articles on specialized topics. The selections 
include stories, poems, essays, reports, and passages 
from textbooks, as well as a sample train schedule, 
telephone bill, and advertisements. Students’ com- 
prehension of these materials is assessed with both 
multiple-choice questions, for which students choose 
a response from a list, and constructed-response ques- 
tions, for which students are asked to write a response. 

The long-term trend mathematics assessment mea- 
sures students’ knowledge of basic facts, their ability to 
carry out numerical algorithms using paper and pencil, 
their knowledge of basic measurement formulas as they 
are applied in geometric settings, and their ability to 
apply mathematics to daily-living skills (such as those 
related to time and money). The computational focus 
of the long-term trend assessment provides a unique 
opportunity to measure how students perform in tradi- 
tional procedural skills. 

The Long-Term Trend Background 
Questionnaires 

In addition to assessing students’ progress in reading 
and mathematics, the NAEP long-term trend assess- 
ments include questions about students’ home and 
school experiences that may be related to educational 
achievement. For example, students are asked about the 
courses they have taken, activities in their classrooms, 
the amount of time they spend on homework, and 
educationally relevant uses of their time out of school. 
Their responses to these questions provide an informa- 
tive context for interpreting the assessment results. 

In the previous long-term trend assessments, these 
background questions were intermixed with the assess- 
ment questions. For example, students would answer 
questions about a reading passage to assess their under- 



standing of that passage, and then they would respond 
to background questions about their reading habits. In 
the modified design, these background questions were 
reduced in number and assembled together in a sepa- 
rate section that students completed after finishing the 
assessment. 

The Student Sample 

The NAEP long-term trend assessments measure the 
performance of students at three ages — 9, 13, and 17. 
The NAEP assessments measure the achievement of 
students nationally and are not intended to provide a 
measure of individual student performance. A nation- 
ally representative sample of students is selected, and 
their results are generalized to the nation as a whole. 
Small percentages of students with disabilities (SD) 
and of English language learners (ELL) are excluded in 
each assessment year based on their schools’ judgment 
that they cannot be meaningfully assessed. Formerly, 
NAEP did not permit students so identified to receive 
accommodations (such as extended time, assessment 
in small groups, or use of bilingual dictionaries). In 
2004, accommodations were permitted on the modified 
assessment, and therefore fewer students were excluded. 
Specifically, approximately 14 to 19 percent of students 
across the three ages and two subjects were identified as 
SD/ELL in 2004, resulting in an exclusion rate of 7 to 
8 percent, depending on the age and subject assessed, 
in the nonaccommodated format. When accommoda- 
tions were permitted, the exclusion rates dropped to 
approximately 3 percent for mathematics and 4 to 3 
percent for reading. (See appendix A for information 
regarding exclusion criteria and exclusion rates.) 

This report contains results representing the perfor- 
mance of all in-school 9-, 13-, and 17-year-olds in the 
nation who are capable of being meaningfully assessed 
without accommodations, except for the results from 
the modified assessment shown in chapter 5. In addi- 
tion, it describes the performance of groups of students, 
such as males and females, in each age group. In 2004, 
more than 1 1,000 students at each of the three ages 
were assessed in each subject area, including both 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



public and private school students. To ensure that the 
sample was nationally representative, a sampling plan 
was created to randomly select schools and students to 
participate. This sampling plan targeted certain schools 
and students for participation in NAEP. The degree 
to which the students who actually participated in the 
assessment matched the target is a measure of the reli- 
ability of the results. In 2004, approximately 80 to 81 
percent of the students originally selected for the assess- 
ment at age 9 were actually assessed, 76 to 77 percent 
of the students at age 13, and 55 to 57 percent at age 
17. (See appendix A for more information on sampling 
procedures and appendix B for the percentages of stu- 
dents in various reporting groups who were assessed.) 

Reporting the Trend Results 

Students’ performance on the long-term trend assess- 
ments is summarized on a 0-500 scale for each subject 
area. For each year in which the assessments were 
administered, achievement in a particular subject area is 
described for a group of students by their average scale 
score and the score at the selected percentiles. Trends in 
student achievement are determined by examining the 
average scale scores attained by students in the current 
assessment year or the score at the selected percentiles 
and comparing them to the same scores in other assess- 
ment years. While the score ranges in both subjects are 
identical, the scale was derived independently for each 
subject. Therefore, average scale scores between subjects 
cannot be compared. 

In addition to reporting average scores, student per- 
formance is described in terms of the percentages of 
students attaining specific levels of performance. These 
performance levels correspond to five points on the 
reading and mathematics scales: 150, 200, 250, 300, 
and 350. For each subject area, the performance 
levels from lowest to highest are associated with 
increasingly advanced skills and knowledge (Allen, 
McClellan, and Stoeckel 2005, pp. 21-22). Examining 
the percentages of students in each year that attained 
each performance level provides additional insight into 
student achievement. 



Because the results presented in this report are based 
on a nationally representative sample of students, they 
are considered estimates of all students’ average perfor- 
mance (excluding students who cannot be meaningfully 
assessed). As such, the results are subject to a degree of 
uncertainty, which is reflected in the standard errors of 
the estimates. The standard errors for all of the scale 
scores and percentages presented in this report can be 
viewed using the NAEP Data Explorer found at http:// 
nces.ed.gov/nationsreportcard/naepdata/ . Statistical 
tests that take into account these standard errors were 
conducted to determine whether apparent changes 
or differences in the results are measurably different 
in a statistical sense. When the term “significant” is 
used, it does not imply a judgment about the absolute 
magnitude or educational relevance of changes and 
differences in student performance. Rather, it is used 
to indicate that the observed changes are not likely to 
be due to chance factors associated with sampling and 
measurement error. The differences described in this 
report have been determined to be statistically signifi- 
cant at the 0.05 level with appropriate adjustments for 
multiple comparisons. In the tables and charts in this 
report, the symbol (*) is used to indicate that a score or 
percentage is measurably different from another. (See 
appendix A for additional information on analysis 
procedures.) 

The results presented here are meant to describe some 
aspects of the condition of education. They are best 
viewed as suggesting various ideas to be further exam- 
ined in light of other data and in the context of the 
large research literature elaborating on the many factors 
contributing to educational achievement. 

About This Report 

This report describes trends in 9-, 13-, and 17- year- 
olds’ achievement in reading and mathematics during 
the last three decades. Chapter 2 presents trends in 
terms of overall scale scores, percentiles, and percentag- 
es at selected performance levels for the nation. Chapter 
3 examines trends in average scale scores for groups 
of students defined by gender, race/ethnicity, and the 



6 



CHAPTER 



1 



education level of the students parents. Chapter 4 
reports results from the NAEP long-term trend back- 
ground questionnaires. In this chapter, students’ school 
and home experiences, as shown in their responses to 
the background questions, are examined in relation to 
students’ assessment scores. Chapter 5 explores the dif- 
ferences between the bridge assessment administered 
under the procedures used for earlier assessments and 
the modified assessment with the new design elements. 
The last chapter in this report provides sample items 
from the NAEP long-term trend assessments. For the 
first time, NCES is releasing items from the assess- 
ment, along with summary data that indicate how 
well students performed on these items. This report 
also contains three appendixes. Appendix A discusses 
technical procedures involved in collecting, analyzing, 
and reporting the assessment data, and appendix B is a 
data appendix showing the percentages of participating 
students in the bridge and modified samples by student 
groups. Appendix C provides a glossary of terms used 
in this report. 

Additional information about the 2004 long-term 
trend assessments not included in this report, and other 
NAEP assessment reports and data, are available on the 
Internet at http : / / nces . ed.gov/ nationsreportcard/ . This 
site contains the data associated with all the figures in 
this report and further information on the technical 
features of the study. Additional data, such as the stan- 
dard errors for each percentage, can also be found on 
this website. 



Cautions in Interpreting the 
Long-Term Trend Results 

The reader is cautioned against using the long-term 
trend results in this report to make simple causal infer- 
ences related to student performance, to the relative 
effectiveness of public and nonpublic schools, or to 
other educational variables discussed in this report. 
Simple cross-tabulations of a variable with measures of 
educational achievement, like the ones presented here, 
cannot constitute proof that differences in the variable 
cause differences in educational achievement. There 
are many possible reasons why the performance of one 
group of students will differ from that of another that 
are not discussed in this report. For example, group dif- 
ferences may be understood better by considering such 
factors as exposure to a rigorous curriculum, variations 
in course-taking patterns, and parental involvement. 

A caution is also warranted for some small population 
group estimates. Smaller population groups may show 
increases or decreases across years in average scores; 
however, it is necessary to interpret such score changes 
with extreme caution. The effects of exclusion-rate 
changes for groups of students may be more marked for 
small groups than they are for the whole population. 
Another reason for caution is that the standard errors 
are often quite large around the score estimates for 
small groups, which in turn means the standard error 
around the gain is also large. 

In addition, although in some figures trend lines for 
ages 9, 13, and 17 will appear in the same graphic, the 
reader is cautioned against making cohort comparisons. 
One cannot interpret the amount of growth between 
ages 9 and 13 from these figures by examining a 4-year 
time difference. Not all assessment years are four years 
apart, and the assessments were administered at differ- 
ent times of the year for the different ages. The relative 
merits of different types of comparisons are discussed 
in appendix A. Comparisons should be made within 
ages only. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



National results are displayed 
using three reporting metrics: 
average scale scores y percen- 
tiles y and performance levels. 
Generally all three metrics show 
improvements at age 9 in read- 
ing and mathematics. 



Chapter 2 

National Trends in Academic 
Achievement 



For the past 35 years, NAEP s long-term trend assessments have docu- 
mented trends in the academic achievement of Americas students. Before 
the 2004 assessment, the last long-term trend assessment was conducted in 
1999. This report examines the changes in students’ performance in read- 
ing and mathematics over the past five years by comparing 2004 results to 
1999 results and then provides a wider view of the overall trends in perfor- 
mance from the early 1970s through 2004. 



This chapter presents the results by subject, first examining the trends 
in reading and then discussing mathematics results. There have been 1 1 
administrations of the reading assessment since 1971 and 10 administra- 
tions of the mathematics assessment since 1973 for ages 9, 13, and 17. 
The next section describes the different ways of reporting results, and the 
remainder of this chapter describes the national trends in reading and 
mathematics. 



8 



CHAPTER 



2 



How the Results Are Presented 

Performance results in this chapter are reported in three 
ways: as average scale scores, as percentile scores, and as 
percentages of students reaching predetermined perfor- 
mance levels. 

► Average scale scores. The average scale scores repre- 
sent the performance of 9-, 13-, and 17-year-olds in 
reading or mathematics averaged across the nation. 
Student performance is summarized on a 0-500 
scale for both reading and mathematics, where the 
different points on the scale represent what students 
know and can do at a given point in time. Although 
the results from both subjects are reported on the 
same scale, the results cannot be compared with one 
another, as they measure different content. 

Line graphs are provided to depict student perfor- 
mance on this scale across the years in both subject 
areas. The average scale score attained by students in 
each assessment year is indicated on the graph. The 
average scores for years prior to 2004 are highlighted 
with an asterisk (*) when the score is significantly 
higher or lower than the average score in 2004. (See 
appendix A for information on the statistical tests 
conducted.) 

► Percentile scores. Going beyond average scores, use- 
ful information can be gained by examining trends 
of student scores falling at specified percentiles along 
the performance distribution. Percentiles indicate 
the percentage of students whose scores fell below 

a particular point on the NAEP scale. For example, 
25 percent of assessed students’ scores fell below the 
25 th percentile score; 75 percent fell below the 75 th 
percentile score. This chapter provides such infor- 



mation by examining the scores of students at five 
distinct percentiles (10 th , 25 th , 50 th , 75 th , and 90 th ) 
of the score distribution in each year. Examining 
student performance at different percentiles on the 
0-500 scale indicates whether or not the changes 
seen in the overall national average score results are 
reflected in the performance of lower-, middle-, and 
higher-performing students. 

► Performance levels. More detailed information about 
what students know and can do in each subject area 
can be gained by examining their attainment of 
specific performance levels in each assessment year. 
For each of the subject area scales, performance levels 
were set at 50-point increments from 150 through 
350. The five performance levels — 150, 200, 250, 
300, and 350 — were then described in terms of the 
knowledge and skills likely to be demonstrated by 
students who reached each level. To develop these 
descriptions, assessment questions were identified 
that students at a particular performance level were 
more likely to answer successfully than students 
at lower levels. The descriptions of what students 
know and can do at each level are based on these 
sets of questions. This process of developing the 
performance-level descriptions is quite different from 
that used to develop achievement-level descriptions 
in the main NAEP reports as they are not set 
through a judgmental process. The levels for long- 
term trends were set arbitrarily and do not represent 
perfomance standards. Specific descriptions for each 
subject are presented later in this chapter along 
with the results. (The procedures for describing the 
performance levels are discussed in more detail in 
appendix A.) 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



National Trends in Reading 
Performance 

National trends are shown through the average score, 
the percentile scores, and the percentage of students 
at or above each performance level. Although at first 
glance it may appear that this report provides the same 
results in three formats, these different reporting met- 
rics actually provide different perspectives. The average 
score summarizes student performance in one measure. 
The percentiles examine performance at five different 
points, demonstrating whether any changes in aver- 
age score are more likely due to changes in the scores 
of lower-performing students or higher-performing 
students. These percentiles are based on a normative 
measure, while the performance levels are based on a 



criterion measure. That is, the performance levels show 
trends in student performance at five benchmarks. 
These benchmarks are valid within all three age groups, 
permitting comparisons of the attainment of absolute 
performance levels over time. Cross-age comparisons 
can be supported, but readers are encouraged to focus 
more appropriately on within-grade comparisons. 

Overall, the national trend in reading shows improve- 
ment across most reporting metrics at age 9 between 
1999 and 2004 as well as between 1971 and 2004. 
Students at age 13 show no significant improvement in 
recent years, although most reporting metrics indicate 
that performance in 2004 was higher than in 1971. At 
age 17, no measurable differences in performance were 
found between 1971 and 2004 for any reporting metric. 



10 



CHAPTER 



2 



Average Scores 

This measure provides a summary account of student 
performance. Figure 2-1 displays the trend lines for 
each age, and further details are given below. 

Nine-year-olds . The average reading score at age 9 was 
higher in 2004 than in any previous assessment year. 

Thirteen-year-olds. The average score at age 13 was 
higher in 2004 than in 1971, but not measurably dif- 
ferent from the average score in 1999. 

Seventeen-year-olds. Between 1999 and 2004, aver- 
age reading scores at age 17 showed no measurable 
changes. The average score in 2004 was similar to that 
in 1971. 



How to interpret this graphic . . . 

Graphics like these show the average scale score at 
each age for each year the assessment was given. 

Each score is plotted, and lines are drawn to connect 
the scores between the different years, creating trend 
lines. Examining the trend lines helps to determine 
whether scores appear to be increasing over time, or 
if there are any peaks or valleys in the 33-year trend. 
Statistically significant differences in scores between 
2004 and previous years are marked with an aster- 
isk. For example, figure 2-1 shows the trend lines of 
the average scores in reading for all three ages. The 
graphic shows that the average score at age 17 was 
about the same in 1971 as in 2004. 



Figure 2-1. Trends in average reading scale scores for students ages 9, 13, and 17: 1971-2004 



Scale score 
500- 



% 



320 

310 

300 H 

290- 

280 

270- 

260- 

250-1 

240 

230- 

220 - 

210 

200-1 



x. 



285 


286 


285 c 


289* 


290* 


290* 


i ■< 

255* 


256* < 


258 


257 


257 


257^ 


i 

208* 


210* < 


215* 


211* 


212* 


209* f 















290* 



260 



288 



258 



288 



258 



288 



259 



285 



259 



Age 17 



Age 13 



1219 



Age 9 



1971 1975 1980 1984 1988 1990 1992 1994 1996 1999 



2004 



*Significantly different from 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1971-2004 Long-Term Trend Reading Assessments. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



11 



Percentile Scores 

Examining the national trends at five percentiles shows 
whether changes seen in the national averages were sus- 
tained at every level of performance or were more likely 
to occur for students of specific ability levels. Figure 
2-2 displays trends in reading scores for 9-, 13-, and 
17-year-old students in the five percentile ranges. The 
results are discussed below for each age level. 

Nine-year-olds . As seen in figure 2-2, only one sig- 
nificant increase was seen at the 90 th percentile as 
compared to 2004. However, the score at the 50 t * 1 
percentile — the median — was higher in 2004 than in 
any other assessment year. The scores at the 10^, 25^, 
and 75 th percentiles showed increases in performance 
between 1999 and 2004 and between 1971 and 2004. 

Thirteen-year-olds . The trends differ between upper and 
lower percentiles. The scores at the 10^, 25 th , and 50 th 
percentiles showed no measurable differences between 
2004 and any previous assessment year. At the 73 th and 
90^ percentiles, scores in 2004 were higher than in 



1971, although no measurable differences were detected 
between the score in 2004 and that in 1999. 

Seventeen-year-olds. Examining the scores at the five 
selected percentiles shows no measurable difference in 
the scores in 2004 compared to either 1971 or 1999. 



How to interpret this graphic . . . 

Graphics like figure 2-2 show the score at each per- 
centile for five selected percentiles. For example, at 
age 9 in 2004 , students at the 10^ percentile scored 
169 in reading, while students at the 90^ percentile 
scored 264. Looking at the five trend lines together, 
it can be determined if more improvement took place 
at the upper end or at the lower end, or if the trend 
lines look the same at all five levels. For example, at 
age 9, the scores at the 10^, 25^, 50^, and 75 ^ 
percentiles showed increases in performance between 
1999 and 2004 and between 1971 and 2004. 



Figure 2-2. Trends in reading scale score at selected percentiles for students ages 9, 13, and 17: 1971-2004 



Age 9 

Scale score Percentile 




▼ 



90t h 



75 th 



50t h 



25 th 



10th 



See notes at end of figure. 



12 



CHAPTER 2 



Figure 2-2. Trends in reading scale score at selected percentiles for students ages 9, 13, and 17: 1971-2004-Continued 



Scale score 
500 



Age 13 



x 



320- 

310 

300 

290 

280 

270- 

260 

250 

240 

230 

220 

210 - 

200 



x 



300* 


300* ( 


302 


302* 


302* 


280* 


281* 


283 


282 


28 i 


» -< 

257 


258 


260 


258 


258 


232 


233 


235 


234 


234 


208 


209 c 


213 


210 c 


213 













281 



257 



233 



Percentile 



309* 


307 


306 _ 


308 


287 


285 


285 


286 


262 


r ™ i 
260 


260 c 


261 


235 


,233 


233 ( 


234 


208 


205 


206 r 


209 











305 



285 



260 



235 



210 



90th 

75th 

50th 

25th 

10th 



1971 1975 



1980 1984 1988 1990 1992 1994 1996 1999 



2004 



Scale score 
500 



X 



340 

330 

320 

310 

300 

290 

280- 

270- 

260 

250 

240 

230 

220 



342 



317 



x 



288 



256 



225 



316 



288 



258 



228 



315 



287 

4 -- 1 



231 



Age 17 



290 



236* 



316 



291 



266* 



241* 



Percentile 



343 , 


343 

1 1 


343 


341 


341 


319*, 


319* 


319* 


316 


316 


291 c 


293* 




290 


289 


289 


263* 

I 4 


263* 


,260 


260 c 


261 


237* 


233 


230 4 


232 


233 











338 



315 



287 



258 



227 



90th 



75th 



50th 



25th 



10th 



1971 1975 



1980 1984 1988 1990 1992 1994 1996 1999 



2004 



^Significantly different from 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1971-2004 Long-Term Trend Reading Assessments. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 


13 







Performance Levels 

This section reports trend results using the performance- 
level reporting metric, examining the percentage of 
students demonstrating particular levels of performance 
over the past three decades. Although one would expect 
these trends to follow closely the trends in average 
scores, it is instructive to examine changes in what stu- 
dents now seem to know and be able to do. 

The skills and abilities demonstrated by students at 
each reading performance level are described below. 

The five performance levels are applicable at all three 
age groups, although the likelihood of attaining higher 
performance levels is directly related to a student s age, 
because older students have completed more education 



in both subject areas. For this reason, only three per- 
formance levels are discussed for each age: levels 150, 
200, and 250 for age 9; levels 200, 250, and 300 for 
age 13; and levels 250, 300, and 350 for age 17. One 
might expect younger students to reach only the first 
performance levels, as they have not yet been taught 
the material in the higher performance levels, and it is 
expected that nearly 100 percent of older students 
will meet the lowest performance levels. Thus, the 
performance-level results displayed for each age are 
those that are most likely to show significant change 
across the assessment years. The levels not shown here 
are those that nearly all or almost no students attained 
at a particular age in each year. 



Reading Performance-Level Descriptions 

LEVEL 350: Learn from Specialized Reading Materials 

Readers at this level can extend and restructure the ideas presented in specialized and complex texts. Examples include scientific 
materials, literary essays, and historical documents. Readers are also able to understand the links between ideas, even when 
those links are not explicitly stated, and to make appropriate generalizations. Performance at this level suggests the ability to 
synthesize and learn from specialized reading materials. 

LEVEL 300: Understand Complicated Information 

Readers at this level can understand complicated literary and informational passages, including material about topics they study 
at school. They can also analyze and integrate less familiar material about topics they study at school as well as provide reac- 
tions to and explanations of the text as a whole. Performance at this level suggests the ability to find, understand, summarize, and 
explain relatively complicated information. 

LEVEL 250: Interrelate Ideas and Make Generalizations 

Readers at this level use intermediate skills and strategies to search for, locate, and organize the information they find in relatively 
lengthy passages and can recognize paraphrases of what they have read. They can also make inferences and reach generaliza- 
tions about main ideas and author’s purpose from passages dealing with literature, science, and social studies. Performance at 
this level suggests the ability to search for specific information, interrelate ideas, and make generalizations. 

LEVEL 200: Demonstrate Partially Developed Skills and Understanding 

Readers at this level can locate and identify facts from simple informational paragraphs, stories, and news articles. In addition, 
they can combine ideas and make inferences based on short, uncomplicated passages. Performance at this level suggests the 
ability to understand specific or sequentially related information. 

LEVEL 150: Carry Out Simple, Discrete Reading Tasks 

Readers at this level can follow brief written directions. They can also select words, phrases, or sentences to describe a simple 
picture and can interpret simple written clues to identify a common object. Performance at this level suggests the ability to carry 
out simple, discrete reading tasks. 



14 



CHAPTER 



2 



Figure 2-3 shows the percentage of students reaching 
each performance level by age and assessment year. The 
following sections discuss the data for each age. It is 
important to keep in mind that the percentages report- 
ed for each level are cumulative. That is, the percentage 
shown for level 200 reflects the percentage of students 
who scored at 200 or above, so it also includes those 
who scored at 250, 300, or 350. 



How to interpret this graphic . . . 

Bar charts are used to show the percentage of students 
who reach each performance level or above. For instance, 
figure 2-3 shows that 80 percent of 17-year-olds in 2004 
reached level 250 or above, 38 percent reached level 300 
or above, and 6 percent reached level 350. So, the 80 
percent bar also includes those students in the 38 and 
6 percent bars. Examining the height of the bars across 
years can help determine whether students are improving 
at the lower levels, higher levels, or both. 



Figure 2-3. Trends in percentages at or above reading performance levels for students ages 9, 13, and 17: 1971-2004 



Age 9 

Percent □ Level 150 or above 




Age 13 

Percent □ Level 200 or above 

100 

90 

80 

70 

60 

50 

40 

30 

20 

10 

0 

1971 1975 1980 1984 1988 1990 1992 1994 1996 1999 2004 




See notes at end of figure. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



15 



Figure 2-3. Trends in percentages at or above reading performance levels for students ages 9, 13, and 17: 1971-2004-Continued 



Age 17 



□ Level 250 or above 

□ Level 300 or above 
■ Level 350 




1971 1975 1980 1984 1988 1990 1992 1994 1996 1999 2004 



*Significantly different from 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1971-2004 Long-Term Trend Reading Assessments. 



Nine-year-olds . Trends in the percentage of 9-year-olds 
scoring at or above reading performance levels 150, 

200, and 250 are shown in the first panel of figure 
2-3. In each assessment year, at least 90 percent of 9- 
year-olds performed the simple, discrete reading tasks 
described at level 150. In 2004, 96 percent of 9-year- 
olds reached level 150, a higher percentage than in any 
previous assessment year. The partially developed skills 
and understanding associated with level 200 were dem- 
onstrated by 70 percent of 9-year-olds in 2004. This 
number was higher than in every other assessment year 
with the exception of 1980, which showed no measur- 
able difference from 2004. The ability to interrelate 
ideas and make generalizations (level 250) was dem- 
onstrated by 20 percent of 9-year-olds in 2004. This 
percentage was higher than both the more recent assess- 
ment year, 1999, and the first assessment year, 1971. 

Thirteen-year-olds. The second panel of figure 2-3 
displays trends in the percentage of 13-year-olds per- 
forming at or above reading performance levels 200, 
250, and 300. In each assessment year, 92 percent or 
more of 13-year-old students performed at or above 
level 200, demonstrating at least partially developed 
skills and understanding. Ninety-four percent of 
students reached level 200 in 2004, which was not 
measurably different from the percentage in any other 



assessment year, except 1994, when the percentage 
fell to 92 percent. The ability to interrelate ideas and 
make generalizations (level 250) was demonstrated 
by 61 percent of 13-year-olds in 2004. Despite some 
apparent fluctuation, no measurable differences were 
found in the percentages of students at or above this 
level of performance across the assessment years. At 
level 300, students demonstrate the ability to under- 
stand complicated literary and informational passages. 
The percentage of students reaching level 300 in 2004 
was higher than the percentage in 1971, mirroring the 
national trend for average score. 

Seventeen-year-olds. Trends in the percentage of 17- 
year-olds scoring at or above reading performance 
levels 250 and 300 and at level 350 are shown in the 
last panel of figure 2-3. The ability to interrelate ideas 
and make generalizations (level 250) was demonstrated 
by 80 percent of 17-year-olds in 2004, which was not 
measurably different from 1999 or 1971. Performance 
at or above level 300 — understanding complicated 
information — was demonstrated by 38 percent of 17- 
year-olds in 2004, which was not measurably different 
from the percentages in 1999 or 1971. Across all of the 
assessment years, only 5 to 7 percent of 17-year-olds 
demonstrated performance at level 350 — the ability to 
learn from and synthesize specialized reading materials. 



16 



CHAPTER 



2 



National Trends in Mathematics 
Performance 

Overall, the national trend in mathematics shows 
improvement in performance at ages 9 and 13 in 2004 
and few changes over the years at age 17. Note that the 
data from 1973 in figure 2-4 were extrapolated using 
a mean proportion correct, meaning that only average 
scores could be calculated. Results by percentile and 
performance levels are shown from 1978 through 2004. 
(See appendix A for further explanation of the extrapo- 
lated results.) The following sections examine the 
national results through the average score, the percen- 
tile scores, and the percentage of students at or above 
each performance level. 



Average Scores 

The first set of results shows trends in average scores in 
mathematics between 1973 and 2004. Figure 2-4 displays 
the trend lines for each age, and further details follow. 

Nine-year-olds . At 241, the average score at age 9 was 
higher in 2004 than in any previous year — up 9 points 
from 1999 and 22 points from 1973. 

Thirteen-year-olds. At age 13, the average score in 2004 
was higher than in any other assessment year. The 5- 
point increase between 1999 and 2004 resulted in an 
average score in 2004 that was 1 5 points higher than 
the average score in 1973. 

Seventeen-year-olds. The average score at age 17 was 
not measurably different from the average score in 
1973 or 1999. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



17 



Figure 2-4. Trends in average mathematics scale scores for students ages 9, 13, and 17: 1973-2004 



Scale score 




Age 17 



Age 13 



Age 9 



*Significantly different from 2004. 

NOTE: Dashed lines represent extrapolated data. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1973-2004 Long-Term Trend Mathematics Assessments. 



How to interpret this graphic . . . 

Graphics like these show the average scale score at 
each age for each year the assessment was given. Each 
score is plotted, and lines are drawn to connect the 
scores between the subsequent assessment years, ere at' 
ing trend lines. Examining the trend lines helps to 
determine whether scores appear to be increasing 
over time, or if there are any peaks or valleys in the 
31 -year trend. Statistically significant differences in 
scores between 2004 and previous years are marked 
with an asterisk. For example, figure 2-4 shows that 
at age 17, the average score in 2004 was not measur- 
ably different from the scores shown in 1990 through 
1999, but it was higher than the scores in 1978, 
1982, and 1986. 



18 



CHAPTER 



2 



Percentile Scores 

This section examines the national trends at five 
percentiles to indicate whether changes seen in the 
national averages are sustained at every level of perfor- 
mance or occurred for students of specific ability levels. 
Figure 2-5 displays trends in mathematics scores for 
9-, 13-, and 17-year-old students in the five percentile 
levels. Note that these trends are not available back to 
1973 because only the overall average scores could be 
extrapolated for 1973. 

Nine-year-olds . The trend lines shown in figure 2-5 
appear very similar to one another at age 9. Nine-year- 
olds showed higher scores at each of the five selected 
percentiles in 2004 than in any other assessment year. 
Between the first year and the most recent assessment 
year — 1978 and 2004 — scores increased 26 points at 
the 10^ percentile, 26 points 1 at the 25^ percentile, 



23 points at the 50^ percentile, 21 points 1 at the 75^ 
percentile, and 18 points at the 90 tl1 percentile. 

Thirteen-year-olds . At age 13, the score at each of the 
five percentile levels was higher in 2004 than in every 
previous assessment year, with the exception of the 10^ 
percentile. The score at the 10^ percentile in 2004 was 
higher than in 1978, but showed no measurable gain 
between 1999 and 2004. 

Seventeen-year-olds. Scores for 17-year-olds at the 10^, 
25 th , and 50 1111 percentiles were higher in 2004 than in 
1978. The scores at the 75^ and 90^ percentiles were 
not measurably different in 2004 compared to 1999 or 
1978. 



How to interpret this graphic . . . 

Graphics like figure 2-5 show the score at each per- 
centile for five selected percentiles. For example , 
at age 9 in 2004 , students at the 10^ percentile 
scored 197 in mathematics, while students at the 
90 ^ percentile scored 282. Both of these scores are 
higher than the scores in any previous assessment year. 
Looking at the five trend lines together ; it can be 
determined if more improvement took place at the 
upper end or at the lower end, or if the trend lines 
look the same at all five levels. 



i 



Detail may not sum to totals because of rounding. Differences between scores are calculated using unrounded values. In this instance, the result of the 
subtraction differs from what would be obtained by subtracting the rounded values shown in the accompanying figure. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



19 



Figure 2-5. Trends in mathematics scale score at selected percentiles for students ages 9, 13, and 17: 1978-2004 




Age 13 




See notes at end of figure. 



20 



CHAPTER 2 



Figure 2-5. Trends in mathematics scale score at selected percentiles for students ages 9, 13, and 17: 1978-2004-Continued 



Age 17 



Scale score 




Percentile 

T 



90 th 

75 * 

50 * 

25 * 

10th 



^Significantly different from 2004. 

NOTE: Mathematics scores at selected percentiles are not available in 1973 because only the overall average scores were extrapolated for this year. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1978-2004 Long-Term Trend Mathematics Assessments. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 


21 







Performance Levels 

The skills and abilities demonstrated by students at each 
mathematics performance level are described below. As 
in reading, the five performance levels are applicable 
at all three ages, but only three performance levels are 
discussed for each age: levels 150, 200, and 250 for age 
9; levels 200, 250, and 300 for age 13; and levels 250, 



300, and 350 for age 17. These performance levels are 
the ones most likely to show significant change within 
an age across the assessment years and do not include 
the levels that nearly all or almost no students attained 
at a particular age in each year. Again, these trends are 
only available from 1978, because only the overall aver- 
age scores could be extrapolated for 1973. 



Mathematics Performance-Level Descriptions 

LEVEL 350: Multistep Problem Solving and Algebra 

Students at this level can apply a range of reasoning skills to solve multistep problems. They can solve routine problems involving 
fractions and percents, recognize properties of basic geometric figures, and work with exponents and square roots. They can solve 
a variety of two-step problems using variables, identify equivalent algebraic expressions, and solve linear equations and inequali- 
ties. They are developing an understanding of functions and coordinate systems. 

LEVEL 300: Moderately Complex Procedures and Reasoning 

Students at this level are developing an understanding of number systems. They can compute with decimals, simple fractions, 
and commonly encountered percents. They can identify geometric figures, measure lengths and angles, and calculate areas of 
rectangles. These students are also able to interpret simple inequalities, evaluate formulas, and solve simple linear equations. 
They can find averages, make decisions based on information drawn from graphs, and use logical reasoning to solve problems. 
They are developing the skills to operate with signed numbers, exponents, and square roots. 

LEVEL 250: Numerical Operations and Beginning Problem Solving 

Students at this level have an initial understanding of the four basic operations. They are able to apply whole number addition 
and subtraction skills to one-step word problems and money situations. In multiplication, they can find the product of a two-digit 
and a one-digit number. They can also compare information from graphs and charts and are developing an ability to analyze 
simple logical relations. 

LEVEL 200: Beginning Skills and Understandings 

Students at this level have considerable understanding of two-digit numbers. They can add two-digit numbers but are still 
developing an ability to regroup in subtraction. They know some basic multiplication and division facts, recognize relations 
among coins, can read information from charts and graphs, and use simple measurement instruments. They are developing 
some reasoning skills. 

LEVEL 150: Simple Arithmetic Facts 

Students at this level know some basic addition and subtraction facts, and most can add two-digit numbers without regrouping. 
They recognize simple situations in which addition and subtraction apply. They also are developing rudimentary classification skills. 



22 



CHAPTER 



2 



Figure 2-6 shows the percentage of students reaching 
each performance level by age and assessment year. The 
following sections discuss the data for each age group. 

Nine-year-olds . Trends in the percentage of 9-year-olds 
attaining mathematics performance levels 150, 200, 
and 250 are displayed in the upper panel of figure 2-6. 
In each assessment year, nearly all 9-year-olds (at least 
97 percent) demonstrated understanding of simple 
arithmetic facts associated with level 150. In 2004, this 
percentage was 99, measurably higher by one percent- 



age point than in 1986, and higher by three points 2 
than in 1978, with no measurable change since 1990. 
The beginning skills and understandings characteristic 
of level 200 was demonstrated by 89 percent of 9- 
year-olds in 2004, higher than in any other assessment 
year. In the 2004 assessment, 42 percent of 9-year-olds 
performed the numerical operations and beginning 
problem solving associated with level 250, a higher per- 
centage than in any other assessment year. There was an 
increase of 1 1 percentage points for 9-year-olds at this 
level between 1999 and 2004. 



Figure 2-6. Trends in percentages at or above mathematics performance levels for students ages 9, 13, and 17: 1978-2004 



Age 9 

Percent □ Level 150 or above 




Age 13 

Percent □ Level 200 or above 




See notes at end of figure. 



2 Detail may not sum to totals because of rounding. Differences between percentages are calculated using unrounded values. In this instance, the result of 
the subtraction differs from what would be obtained by subtracting the rounded values shown in the accompanying figure. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



23 



Figure 2-6. Trends in percentages at or above mathematics performance levels for students ages 9, 13, and 17: 
1978-2004-Continued 



Age 17 

Percent □ Level 250 or above 




*Significantly different from 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1978-2004 Long-Term Trend Mathematics Assessments. 



Thirteen-year-olds. The percentage of 13-year-old stu- 
dents scoring at or above mathematics performance 
levels 200, 250, and 300 across the assessment years are 
displayed in the middle panel of figure 2-6. Since 1986, 
99 percent of 13-year-olds demonstrated the beginning 
skills and understandings associated with level 200. In 
2004, 83 percent scored at or above level 250, demon- 
strating the ability to perform numerical operations and 
beginning problem solving. Overall gains are also evi- 
dent at level 300, where students performed moderately 
complex procedures and reasoning. The percentage of 
students who scored at or above this level increased 
from 18 percent in 1978, to 23 percent in 1999, and to 
29 percent in 2004. 



Seventeen-year-olds. Trends in the percentage of 17- 
year-olds scoring at or above mathematics performance 
levels 250, 300, and 350 are displayed in the last 
panel of figure 2-6. Since 1986, at least 96 percent 
of 17-year-olds have performed at or above level 250, 
demonstrating the ability to perform numerical opera- 
tions and beginning problem solving. The percentage 
of 17-year-olds who performed moderately complex 
procedures and reasoning (level 300) showed no mea- 
surable change from 1990 to 2004, but has increased by 
7 percentage points from 1978. No measurable change 
between 2004 and all the previous assessment years can 
be detected at 350, the highest performance level, in 
which students applied a range of reasoning skills to 
solve multistep problems. Across the assessment years, 
between 5 and 8 percent of students performed at this 
level. 



24 



CHAPTER 



2 



Summary 

The results presented in this chapter give an overall 
view of national trends in reading and mathematics 
achievement. Average scores for the nation, scores for 
students in five different ranges of the performance dis- 
tribution, and attainment of specific performance levels 
were discussed. Looking across the 33 years, upward 
trends are most noticeable at age 9 in both reading and 
mathematics. Also of interest is the increase in perfor- 
mance at age 13 in mathematics. 

The following figures provide an overview of the 
major findings presented in this chapter by comparing 
students’ performance in 2004 to that of their counter- 
parts in the first year data were collected. In addition, 
2004 and 1999 results are compared, providing a sum- 
mary of trends over the last five years. 

Arrows pointing upward (^) indicate improvement, 
and horizontal arrows (^) indicate no measurable 
change in performance. For example, the first line of 
the display in figure 2-7 indicates that the national 
average reading score for 9-year-olds was higher in 
2004 than it was in 1971 or 1999. 



Figure 2-7. Summary of trends in reading and mathematics 
average scale scores for students ages 9, 13, 
and 17: 1971-2004 



Reading 

^ 9-year-olds’ average scale scores since 1971 ( ^ since 1999) 

^ 13-year-olds' average scale scores since 1971 ( * since 1999) 

* 17-year-olds' average scale scores since 1971 ( * since 1999) 

Mathematics 

^ 9-year-olds’ average scale scores since 1973 ( ^ since 1999) 

^ 13-year-olds' average scale scores since 1973 ( ^ since 1999) 

* 17-year-olds’ average scale scores since 1973 ( * since 1999) 

^ Significantly higher in 2004. 

+ Indicates no significant difference between earlier year and 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), selected 
years, 1971-2004 Long-Term Trend Reading and Mathematics Assessments. 



Figure 2-8. Summary of trends in reading and mathematics 
scale score percentiles for students ages 9, 13, 
and 17: 1971-2004 



Reading 


9-year-olds 


^ 10th percentile since 1971 


( ^ since 1999) 


^ 25th percentile since 1971 


( ^ since 1999) 


^ 50th percentile since 1971 


( ^ since 1999) 


^ 75th percentile since 1971 


( ^ since 1999) 


^90th percentile since 1971 


( * since 1999) 


13-year-olds 


10th percentile since 1971 


( * since 1999) 


^25th percentile since 1971 


( since 1999) 


50th percentile since 1971 


( + since 1999) 


^ 75th percentile since 1971 


( ^ since 1999) 


^ 90th percentile since 1971 


( since 1999) 


17-year-olds 




10th percentile since 1971 


( ^ since 1999) 


25th percentile since 1971 


( * since 1999) 


50th percentile since 1971 


( since 1999) 


* 75th percentile since 1971 


( ^ since 1999) 


90th percentile since 1971 


( ^ since 1999) 


Mathematics 




9-year-olds 


^ 10th percentile since 1978 


( ^ since 1999) 


^ 25th percentile since 1978 


( ^ since 1999) 


^ 50th percentile since 1978 


( ^ since 1999) 


^ 75th percentile since 1978 


( ^ since 1999) 


^ 90th percentile since 1978 


( ^ since 1999) 


13-year-olds 




^ 10th percentile since 1978 


( since 1999) 


^ 25th percentile since 1978 


( ^ since 1999) 


^ 50th percentile since 1978 


( ^ since 1999) 


^ 75th percentile since 1978 


( ^ since 1999) 


^ 90th percentile since 1978 


( ^ since 1999) 


17-year-olds 




^ 10th percentile since 1978 


( ^ since 1999) 


^ 25th percentile since 1978 


( since 1999) 


^ 50th percentile since 1978 


( * since 1999) 


* 75th percentile since 1978 


( ^ since 1999) 


90th percentile since 1978 


( ^ since 1999) 



^ Significantly higher in 2004. 

Indicates no significant difference between earlier year and 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), selected 
years, 1971-2004 Long-Term Trend Reading and Mathematics Assessments. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



25 



Figure 2-9. Summary of trends in reading and mathematics percentages at or above 
performance levels for students ages 9, 13, and 17: 1971-2004 



Reading 


9-year-olds 


^ Level 150 (simple, discrete reading tasks) since 1971 
^ Level 200 (partially developed skills and understanding) since 1971 
^ Level 250 (interrelate ideas and make generalizations) since 1971 


( ^ since 1999) 
( ^ since 1999) 
( ^ since 1999) 


13-year-olds 

Level 200 (partially developed skills and understanding) since 1971 
* Level 250 (interrelate ideas and make generalizations) since 1971 
^ Level 300 (understand complicated information) since 1971 


( since 1999) 
( ^ since 1999) 
( ^ since 1999) 


17-year-olds 


* Level 250 (interrelate ideas and make generalizations) since 1971 

* Level 300 (understand complicated information) since 1971 
^ Level 350 (learn from specialized reading materials) since 1971 


( ^ since 1999) 
( ^ since 1999) 
( since 1999) 


Mathematics 




9-year-olds 


^ Level 150 (simple arithmetic facts) since 1978 
^ Level 200 (beginning skills and understandings) since 1978 
^ Level 250 (numerical operations and beginning problem solving) since 1978 
13-year-olds 


( ^ since 1999) 
( ^ since 1999) 

( ^ since 1999) 


^ Level 200 (beginning skills and understandings) since 1978 
^ Level 250 (numerical operations and beginning problem solving) since 1978 
^ Level 300 (moderately complex procedures and reasoning) since 1978 
17-year-olds 


( ^ since 1999) 
( ^ since 1999) 

( ^ since 1999) 


^ Level 250 (numerical operations and beginning problem solving) since 1978 
^ Level 300 (moderately complex procedures and reasoning) since 1978 
* Level 350 (multistep problem solving and algebra) since 1978 


( ^ since 1999) 
( since 1999) 
( ^ since 1999) 



^ Significantly higher in 2004. 

Indicates no significant difference between earlier year and 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of 
Educational Progress (NAEP), selected years, 1971-2004 Long-Term Trend Reading and Mathematics Assessments. 



26 



THIS PAGE INTENTIONALLY LEFT BLANK. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



27 



These long-term trend results 
show some progress since 1999 
towards reducing the White- 
Black and White-Hispanic score 
gaps at age 9. 



Chapter 3 

Trends in Academic Achievement 
Among Student Groups 

A key goal of the NAEP long-term trend assessment is to monitor the 
progress of various groups of students to determine whether any change 
in national scores is occurring across all student groups or is limited to a 
particular group. It is important to examine the performance gaps between 
student groups and any changes in these gaps over time as well as the over- 
all achievement of all students. The assessment results presented in this 
chapter provide one source of information useful in monitoring progress of 
student achievement in this country. 



Some of the student groups measured by this assessment are defined by 
gender, race/ethnicity, parental education level, and type of school (public 
or nonpublic). However, this report provides data only on those groups 
with sufficient sample size to produce reliable results. For instance, only 
White, Black, and Hispanic racial/ethnic groups are described here, as the 
sample sizes for Asian/Pacific Islander and American Indian/Alaska Native 
students were too small to provide reliable estimates. See tables B-l and 
B-2 in the appendix for information on the percentage distribution of par- 
ticipating students by racial/ethnic group. 

The NAEP long-term trend assessment has examined public and non- 
public school students’ performance separately since 1980 in reading and 
1978 in mathematics. However, in this report, results for nonpublic schools 
are neither displayed nor discussed because the participation rates for non- 
public schools were too low to produce valid and reliable results (see the 
School and the Student Sampling sections of appendix A for more detail). 
NAEP is preparing a report on the performance of nonpublic (private) 
school students with trend results from the main NAEP assessments (Perie, 
Vanneman, and Goldstein [forthcoming]). 

The performance of students in each of these student groups is described 
in this chapter. First, descriptions of the student groups are given, and 
then the results for reading are displayed, followed by mathematics. Line 
graphs are used to display the average reading and mathematics scale scores 
attained by students in each group across the assessment years. Where 
appropriate, gaps between the student groups are also presented. For 
instance, the charts highlight any differences in scores of male and female 
students as well as the average score gaps between Black and White stu- 
dents and Hispanic and White students. The average score of each student 
group and age (9-, 13-, and 17-year-olds) is placed on a 0-300 scale in 
both subject areas to provide a numeric summary of students’ performance. 



28 



CHAPTER 



3 



Description of Student Groups 

Results from the long-term trend assessment are pre- 
sented in this chapter for gender, race/ethnicity, and 
highest level of parents’ education. The following sec- 
tions describe how the data were collected on each of 
the student groups discussed in this chapter, and give 
relevant background information about group member- 
ship and achievement. 

Gender 

In years past, gender differences have received consider- 
able attention. Male students traditionally scored higher 
on average than female students in mathematics and 
science, while females scored higher on average than 
males in reading and writing (Baker and Jones 1993; 
Bauer, Park, and Sullivan 1998; Freeman 2004; Mullis 
et al. 1998). Now, gender differences are less pro- 
nounced in the United States than in other countries. 
For instance, in a recent international assessment of 
13-year-olds, no differences were found in the United 
States between male and female students’ scores in 
mathematics, but there were gender gaps in reading in 
which females scored higher than males in the United 
States (Lemke et al. 2002). So, although much of the 
nation’s attention has shifted to the performance gaps 
between different racial/ethnic groups, it is important 
to continue to examine the trends in the male-female 
score gap. 

The roster of sampled students from each partici- 
pating school identifies the students as either male or 
female. These data are used to examine trends in male 
and female students’ average reading and mathematics 
scores, which are presented in this chapter. 



Race/Ethnicity 

Previous main NAEP reports have shown a consistent 
finding of White and Asian students outperforming 
their Black and Hispanic peers. (See, for example, 
Braswell et al. 2005; Donahue, Daane, and Jin 2005.) 
Reducing the performance gaps between racial/ethnic 
groups is a primary goal of the recent federal legislation 
in education (NCLB 2002). 

Although data are collected on five mutually 
exclusive racial/ethnic groups, the performance of 
only three groups is reported in this section — White, 
Black, and Hispanic students. The other racial/ethnic 
groups — Asian/Pacific Islander and American Indian/ 
Alaska Native — are not reported, as the samples col- 
lected were of insufficient size to analyze and report 
separately. Data for Hispanic students were not available 
in 1971, so the trend in reading scores for this group 
runs from 1975 through 2004. 

Relatively small numeric changes in scores are more 
likely to be statistically significant for White students 
than for Black or Hispanic students, because the 
weighted samples of White students tended to be larger 
than weighted samples for other racial/ethnic groups, 
with a corresponding lower margin of error. That is, 
the standard errors associated with larger groups, such 
as White students, are smaller than the standard errors 
associated with smaller groups, such as Hispanic stu- 
dents. Therefore, a similar difference between years in 
scale scores is more likely to be statistically significant 
for the larger group than for the smaller group. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



29 



Parents’ Highest Level of Education 

Parental education may influence student performance 
in school in a variety of ways. Earlier NAEP reports 
have shown that across all ages and subject areas, stu- 
dents who reported higher parental education levels 
tended to have higher assessment scores, on average. 
(See, for example, Braswell et al. 2005; Donahue, 
Daane, and Jin 2005.) 

In the long-term trend assessment background ques- 
tionnaires, students at all three ages are asked to identify 
the highest level of education attained by their parents. 
The student indicates how far each parent went in 
school, choosing from the following categories: did not 
finish high school, graduated from high school, went 
to another school after high school, graduated from 
college, and I don’t know. The highest education level 
of either parent is used in these analyses. Data go back 
to 1978 in mathematics and 1980 in reading. In 1971 
and 1975, students were asked to choose their parents’ 
highest education level from among fewer categories. 

For purposes of this section, only the results from 1978 
forward will be discussed so that “some education after 
high school” and “college graduate” can be analyzed 
separately. It should be noted that 9-year-olds’ reports 
of their parents’ education level may not be as reliable as 
those of older students and are therefore not reported. 

Trends in Reading Scores 
by Student Groups 

This section presents the results of the long-term trend 
reading assessment for each of the four types of groups. 
For gender and race/ethnicity, first the results are pre- 
sented for each student group, and then the score gaps 
between the groups are examined. 



Trends in Reading Scores by Gender 

Trends in reading scores for both male and female stu- 
dents are shown in figure 3-1. Among male students, 
9-year-olds had a higher average score in 2004 than in 
any previous assessment year. Thirteen-year-old males’ 
average reading score in 2004 was higher than the 
scores in 1971 and 1975 but not measurably different 
from the scores in all other assessment years. In 2004, 
the average score of male 17-year-olds showed no mea- 
surable difference from 1971 or 1999. 

The reading trends of female students are similar 
to those of male students. At both ages 9 and 13 the 
female students’ average reading scores were significant- 
ly higher in 2004 than in 1971. At age 9, the average 
score for female students was higher in 2004 than in 
any previous assessment year except 1980. There were 
no measurable differences in average scores for 13-year- 
old female students between 1975 and 2004. At age 17, 
female students’ average score in 2004 was lower than 
those in 1990 and 1992 but not measurably different 
from that in 1971. 

Score Differences Between Male and Female Students 

Figure 3-1 also displays the gap between the male and 
female average scores. All reading score differences 
show female students scored higher on average than 
their male counterparts in 2004. The gender gap at age 
9 decreased from 13 score points in 1971 to 5 score 
points in 2004. In contrast, there has been no measur- 
able change in the score gap at age 13 between 2004 
and any previous assessment year. For 17-year-olds, the 
score gap in 2004 was larger than the gaps in 1988 and 
1980, but showed no measurable difference from the 
gaps in other assessment years. 



30 



CHAPTER 



3 



Figure 3-1. Trends in average reading scale scores and score gaps for students ages 9, 13, and 17, by gender: 1971-2004 



Age 9 

Scale score 




Female 

Score gap 1 

Male 



Age 13 

Scale score 




Female 

Score gap 1 

Male 



See notes at end of figure. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



31 



Figure 3-1. Trends in average reading scale scores and score gaps for students ages 9, 13, and 17, by gender: 1971-2004-Continued 



Age 17 

Scale score 




Female 

Score gap 1 

Male 



^Significantly different from 2004. 

x Male average scale score minus female average scale score. 

NOTE: Score gaps are calculated based on differences between unrounded average scale scores. Negative numbers indicate that the average scale score for male students was lower 
than the score for female students. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1971-2004 Long-Term Trend Reading Assessments. 



How to interpret this graphic . . . 

Graphics such as those in figures 3-1, 3-2, and 3-3 are 
called u gap charts. ” They are intended to show both 
the trend in performance of a single student group over 
time (such as female students) and the gap between two 
groups of students (such as males and females). In figure 
3-1, the average reading scores of male and female stu- 
dents are graphed separately, and the difference between 
the two scores is shown. For example, in 2004 , female 
9-year-olds had an average score of 221, and male 9- 
y ear-olds had an average score of 216. When the average 
score for female students is subtracted from the average 
score of male students, the difference is —5 points. All 
differences are shaded. 



32 



CHAPTER 



3 



Trends in Reading Scores by 
Race/Ethnicity 

Figures 3-2 and 3-3 display the average reading scores 
across assessment years for White, Black, and Hispanic 
students as well as the score gaps between White and 
Black or White and Hispanic students. 

Trends in Reading for White Students 

For White students, the average scores for 9- and 13- 
year-olds were higher in 2004 than in 1971. As with 
the national sample, scores for White 9-year-olds were 
higher in 2004 than in any previous assessment year. 

Trends in Reading for Black Students 

For Black students at all three ages, average reading 
scores in 2004 were higher than in 1971. At age 9, 
Black students scored higher on average in 2004 than 
in any previous administration year, up 30 points 



from 1971 and up 15 points 1 since 1999. For age 13, 
scores increased by 22 points between 1971 and 2004. 
Average scores for Black students at age 17 increased 
between 1971 and 2004 by 25 points. 

Score Differences Between White and Black Students 

As shown in figure 3-2, the differences in scores for 
White and Black students have decreased between the 
first (1971) and the most recent (2004) assessments 
across all three ages, although White students scored 
higher on average than Black students at each age level 
in 2004. 

The score gap between Black and White students at 
age 9 decreased by 18 points between 1971 and 2004 
and by 9 points between 1999 and 2004. At age 13, 
the gap decreased from 39 points in 1971 to 22 points 
in 2004. At age 17, the gap decreased by 24 points 
between 1971 and 2004. 



Figure 3-2. Trends in average reading scale scores and score gaps for White students and Black students ages 9, 13, and 17: 
1971-2004 



Age 9 

Scale score 




White 

Score gap 1 

Black 



See notes at end of figure. 



1 Detail may not sum to totals because of rounding. Differences between scores are calculated using unrounded values. In this instance, the result of the 
subtraction differs from what would be obtained by subtracting the rounded values shown in the accompanying figure. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



33 



Figure 3-2. Trends in average reading scale scores and score gaps for White students and Black students ages 9, 13, and 17: 
1971-2004-Continued 



Age 13 

Scale score 




Age 17 

Scale score 




White 

Score gap 1 

Black 



*Significantly different from 2004. 

x White average scale score minus Black average scale score. 

NOTE: Score gaps are calculated based on differences between unrounded average scale scores. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1971-2004 Long-Term Trend Reading Assessments. 



34 



CHAPTER 



3 



Trends in Reading for Hispanic Students 

The average reading scores for Hispanic students show 
mixed results across the ages. As with the other racial/ 
ethnic groups, the average reading score for Hispanic 
students at age 9 was higher in 2004 than in any other 
assessment year. The average score for Hispanic stu- 
dents at age 13 shows an increase of 10 points between 
1975 and 2004. The scores for 17-year-old Hispanic 
students increased by 11 points 2 between 1975 and 
2004, but no measurable changes were seen between 
1999 and 2004. It is worth noting that due to smaller 
sample sizes, the standard errors associated with the 
scores of Hispanic students are relatively large, meaning 
that differences that look large may not be statistically 
significant. 



Score Differences Between White and Hispanic Students 

As shown in figure 3-3, White students scored higher 
on average than their Hispanic peers in reading at each 
age in 2004. 

At age 9, the score gap between White and Hispanic 
students decreased from 34 points in 1975 to 21 points 
in 2004. At age 13, any apparent changes between 
2004 and all previous assessment years in the size of 
the score gap were not statistically significant, except 
between 2004 and 1994, when the score gap narrowed 
by 6 points. At age 17, the score gap between White 
and Hispanic students was measurably smaller in 2004 
than in 1975. 



Figure 3-3. Trends in average reading scale scores and score gaps for White students and Hispanic students ages 9, 13, and 17: 
1971-2004 



Age 9 

Scale score 




White 

Score gap 1 

Hispanic 2 



See notes at end of figure. 



2 



Detail may not sum to totals because of rounding. Differences between scores are calculated using unrounded values. In this instance, the result of the 
subtraction differs from what would be obtained by subtracting the rounded values shown in the accompanying figure. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



35 



Figure 3-3. Trends in average reading scale scores and score gaps for White students and Hispanic students ages 9, 13, and 17: 
1971-2004-Continued 



Age 13 

Scale score 



500- 

320 

310- 

300~ 

290- 

280 

270- 




261* 


262* < 


264 


263* 


261* 


262 * ( 


266 


265 , 

pmmm i 


266 


267 


266 

) 


260 

250- 

240- 

230- 

220- 

210 

200 

0 


( 


i — 1 < 


30 


27 


23 


21 


24 


27 


30* 


28 


23 

244 H 


24 

*242 


r 




532* 


*23 7 


240 


240 ' 


*238 


239^ 


235* 


238 





























1971 1975 1980 1984 1988 1990 1992 1994 1996 1999 2004 



White 

Score gap 1 

Hispanic 2 



Age 17 

Scale score 




White 

Score gap 1 

Hispanic 2 



*Significantly different from 2004. 

x White average scale score minus Hispanic average scale score. 

2 Data for Hispanic students are included in the overall national results but not reported as a separate racial/ ethnic category in 1971. 

NOTE: Score gaps are calculated based on differences between unrounded average scale scores. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1971-2004 Long-Term Trend Reading Assessments. 



36 



CHAPTER 



3 



Trends in Reading Scores by Parents' Highest 
Level of Education 

The average reading scores of students at ages 13 and 
17 by students’ reports of parents’ highest education 
level across the assessment years are shown in figure 
3-4. Results are not reported at age 9, because internal 
research shows that students’ reports of their parents’ 
education level are less reliable at this age. The percent- 
age of students reporting that at least one parent had 
graduated from college has increased since 1980, while 
the percentages of students reporting that the highest 
level of education for their parents was a high school 
diploma or less has decreased (see table B-2). 

Among 13-year-olds, there were no measurable differ- 
ences in average scores between 2004 and all previous 
assessment years regardless of student-reported level of 
parental education. In 2004, scores averaged 251, 264, 
and 270, respectively, for students who reported that at 
least one parent graduated from high school, completed 
some education after high school, or graduated from 



college. None of these average scores was measurably 
different from the average scores in 1999 or 1980. 

At age 17, there were no measurable differences in 
average scores in 2004 compared to average scores in 
1980 and 1999 for three of the four student-reported 
levels of parents’ education. The exception was for stu- 
dents who reported that at least one parent had some 
education after high school. At age 17, the average 
score for students who indicated their parents had some 
education after high school was lower in 2004 than in 
any previous assessment year, dropping from 295 to 
286 between 1999 and 2004. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



37 



Figure 3-4. Trends in average reading scale scores for students ages 13 and 17, by student-reported parents’ highest level 
of education: 1980-2004 



Age 13 

Scale score 




Age 17 



Scale score 




*Significantly different from 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1980-2004 Long-Term Trend Reading Assessments. 



38 



CHAPTER 



3 



Trends in Mathematics Scores 
by Student Groups 

This section presents the results of the long-term trend 
mathematics assessment for the various student groups. 
For gender and race/ethnicity, the results are presented 
first for each group separately and then the score gaps 
between the groups are examined. 



The trends for female students were similar, as aver- 
age scores in 2004 were higher than in any previous 
assessment year at ages 9 and 13. At age 13, there was a 
3-point increase in the average scores of female students 
between 1999 and 2004. At age 17, female students 
scored higher in 2004 than in 1973 but showed no 
measurable difference between the scores in 1999 and 
2004. 



Trends in Mathematics Scores by Gender 

As discussed in chapter 2, the mathematics national 
trend showed higher average scores in 2004 than in pre- 
vious assessment years for ages 9 and 13, while at age 
17 there were no measurable changes in average scores 
between 2004 and 1973 or 1999. For the most part, the 
scores of male and female students paralleled that trend, 
as seen in figure 3-5. 

For male students, the average mathematics scores at 
ages 9 and 13 were higher in 2004 than in any previ- 
ous assessment year. Scores for males at age 9 increased 
by 25 points between 1973 and 2004 and by 10 points 
between 1999 and 2004. The average score for male 
students at age 13 was higher in 2004 than in 1999 by 
5 points.^ The average score for male students at age 
17 was higher in 2004 than in 1978, but there was no 
measurable difference between the scores in 1999 and 
2004. 



Score Differences Between Male and Female Students 

Figure 3-5 also shows the gap between the average 
mathematics scores of males and females. At age 9, 
the apparent difference between male and female stu- 
dents in 2004 was not statistically significant, while the 
change in the score gap between 1973 and 2004 was 
statistically significant. Males had higher average scores 
than females at ages 13 and 17. The gender score gaps 
for 13- and 17-year-olds were measurably different 
between 1973 and 2004. 



How to interpret this graphic . . . 

Graphics such as those in figures 3-5— 3-7 are called 
“gap charts. ” They are intended to show both the trend 
in performance of a single student group over time (such 
as female students ) and the gap between two groups of 
students (such as males and females). In figure 3-6 y the 
average mathematics scores of male and female students 
are graphed separately ; and the difference between the 
two scores is shown. For example , in 2004, female 
9-year-olds had an average score of 240 y and male 9- 
year-olds had an average score of 243. When the average 
score for female students is subtracted from the average 
score of male students , the difference is 3 points y shown 
with the dotted line. All differences are shaded. 



3 



Detail may not sum to totals because of rounding. Differences between scores are calculated using unrounded values. In this instance, the result of the 
subtraction differs from what would be obtained by subtracting the rounded values shown in the accompanying figure. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



39 



Figure 3-5. Trends in average mathematics scale scores and score gaps for students ages 9, 13, and 17, by gender: 1973-2004 



Age 9 

Scale score 




Score gap 1 

Male 

Female 



Age 13 

Scale score 




Score gap 1 

Male 

Female 



See notes at end of figure. 



40 



CHAPTER 



3 



Figure 3-5. Trends in average mathematics scale scores and score gaps for students ages 9, 13, and 17, by gender: 
1973-2004-Continued 



Age 17 

Scale score 




Score gap 1 

Male 

Female 



#The estimate rounds to zero. 

*Significantly different from 2004. 

1 Male average scale score minus female average scale score. Negative numbers indicate that the average scale score for male students was lower than the score for female students. 
NOTE: Dashed lines represent extrapolated data. Score gaps are calculated based on differences between unrounded average scale scores. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1973-2004 Long-Term Trend Mathematics Assessments. 



Trends in Mathematics Scores by Race/ Ethnicity 

In 2004, the mathematics scores of the three largest 
racial/ethnic groups, as measured by the NAEP long- 
term trend assessment, show increases in performance 
at all ages. Oftentimes, these changes seem different 
from the overall trends. These differences are due to 
changes in the demographics in the population. Figure 
3-6 displays the average scores and score gaps across 
assessment years in mathematics for White and Black 
9-, 13-, and 17-year-old students. 

Trends in Mathematics for White Students 

The average score of 247 in 2004 for White students 
at age 9 was higher than in any previous assessment 
year. At age 13, White students had an average score 
of 288 in 2004, which was higher than in any previous 
assessment year. Average scores for White 17-year-olds 



showed no measurable difference between 1999 and 
2004. However, their average score of 313 in 2004 was 
higher than the average score in 1973. 

Trends in Mathematics for Black Students 

The average scores for Black students were higher in 
2004 than in 1973 at all three ages. The scores for 
Black 9-year-olds showed measurable increases between 
2004 and any previous assessment year. The score in 
2004 was 34 points higher than the score in 1973 and 
13 points higher than that in 1999. The 2004 math- 
ematics score for Black 13-year-olds was higher than in 
any previous assessment year, and an 1 1 -point increase 
in scores occurred between 1999 and 2004. The aver- 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



41 



age score for Black 17-year-olds in 2004 was higher 
than the average score in 1973, but not measurably dif- 
ferent from the average score in 1999. 

Score Differences Between White and Black Students 

As seen in figure 3-6, the differences in average scores 
for White and Black students at all ages decreased 
between the first (1973) and the most recent (2004) 



assessments in mathematics, although White students 
continued to outperform Black students in 2004. 

At age 9, the gap decreased from 33 points in 1973 
to 23 points in 2004. At age 13, the gap decreased 
from 46 points in 1973 to 27 points in 2004, while 
the apparent difference in the gaps between 1999 and 
2004 was not statistically significant. At age 17, the gap 
decreased from 40 points in 1973 to 28 points in 2004. 



Figure 3-6. Trends in average mathematics scale scores and score gaps for White students and Black students ages 9, 13, and 17: 
1973-2004 



Age 9 

Scale score 




White 

Score gap 1 

Black 



See notes at end of figure. 



42 



CHAPTER 



3 



Figure 3-6. Trends in average mathematics scale scores and score gaps for White students and Black students ages 9, 13, and 17: 
1973-2004-Continued 



Age 13 

Scale score 




White 

Score gap 1 

Black 



Age 17 

Scale score 




White 

Score gap 1 

Black 



*Significantly different from 2004. 

x White average scale score minus Black average scale score. 

NOTE: Dashed lines represent extrapolated data. Score gaps are calculated based on differences between unrounded average scale scores. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1973-2004 Long-Term Trend Mathematics Assessments. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



43 



Trends in Mathematics for Hispanic Students 

Figure 3-7 shows the trend lines for White and 
Hispanic students from 1973 to 2004. Hispanic stu- 
dents’ average scores in mathematics were higher at all 
three ages in 2004 than in 1973. At age 9, the aver- 
age score for Hispanics in 2004 was 28 points higher 
than the score in 1973 and higher than in any previous 
assessment year. At age 13, the average score in 2004 
was higher than in any previous assessment year. At age 
17, there was no measurable difference in average scores 
for Hispanic students between 1999 and 2004. 



Score Differences Between White and Hispanic Students 

As shown in figure 3-7, there were few changes in the 
score gap between White and Hispanic students. White 
students outscored Hispanic students at all three ages 
in 2004. 

At age 9, the 2004 score gap between White and 
Hispanic students was measurably narrower than the 
gap in 1999, but showed no measurable difference 
from the gap in 1973. At age 13, the score gap in 2004 
was narrower than the gaps in 1973 and 1978, but not 
measurably different from the gaps in any other assess- 
ment year. At age 17, the White-Hispanic score gap was 
smaller in 2004 than in 1973, but it was not measur- 
ably different from 1999 or any other assessment year 
after 1973. 



Figure 3-7. Trends in average mathematics scale scores and score gaps for White students and Hispanic students ages 9, 13, and 
17: 1973-2004 



Age 9 

Scale score 




White 

Score gap 1 

Hispanic 



See notes at end of figure. 



44 



CHAPTER 



3 



Figure 3-7. Trends in average mathematics scale scores and score gaps for White students and Hispanic students ages 9, 13, 
and 17: 1973-2004-Continued 



Age 13 

Scale score 




White 

Score gap 1 

Hispanic 



Age 17 

Scale score 




White 

Score gap 1 

Hispanic 



*Significantly different from 2004. 

%hite average scale score minus Hispanic average scale score. 

NOTE: Dashed lines represent extrapolated data. Score gaps are calculated based on differences between unrounded average scale scores. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1973-2004 Long-Term Trend Mathematics Assessments. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



45 



Trends in Mathematics Scores by Parents' Highest 
Level of Education 

Average mathematics scores for students at ages 13 and 
17 by highest level of parents’ education as reported 
by the student are shown in figure 3-8. Results are not 
reported at age 9, because studies have shown that stu- 
dents’ reports of their parents’ education level are less 
reliable at this age. 

At age 13, for students who reported that at least 
one parent had graduated from high school, had some 
education after high school, or had graduated from col- 
lege, the average scores in 2004 were higher than in any 
other assessment year. Students who reported that their 
parents had less than a high school education had an 
average score in 2004 that was higher than the average 
score in 1978, but was not measurably different from 
the average score in 1999. 



The average mathematics scores for 17-year-olds 
showed no measurable changes between 2004 and any 
previous assessment year for students who reported that 
at least one parent had graduated from high school or 
had some education after high school. For students 
with at least one parent who graduated from college, 
the average score of 17-year-olds was about the same in 
2004 as in 1999 and in 1978 with an average score of 
317. Students who reported that their parents had less 
than a high school education comprised the only group 
to show improvement between 1978 and 2004. 



Figure 3-8. Trends in average mathematics scale scores for students ages 13 and 17, by student-reported parents’ highest level 
of education: 1978-2004 



Age 13 

Scale score 




Graduated from college 

Some education 
after high school 

Graduated from high school 
Less than high school 



See notes at end of figure. 



46 



CHAPTER 



3 



Figure 3-8. Trends in average mathematics scale scores for students ages 13 and 17, by student-reported parents’ highest level 
of education: 1978-2004-Continued 



Age 17 

Scale score 




Graduated from college 

Some education 
after high school 

Graduated from high school 
Less than high school 



^Significantly different from 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), selected years, 
1978-2004 Long-Term Trend Mathematics Assessments. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



47 



Summary 

This chapter presented results from the NAEP read- 
ing and mathematics long-term trend assessments for 
students in different reporting groups. The reporting 
groups examined were gender, race/ethnicity, and level 
of parental education. 

The following figures, 3-9 through 3-11, provide an 
overview of the major findings presented in this chap- 
ter. In each line of the display, the average score for a 
particular group of students in 2004 is compared to 
that in the first assessment year in which data are avail- 
able, and to that in 1999. Arrows pointing upward (^) 
indicate increases, horizontal arrows (^) indicate no 
measurable change, and arrows pointing downward 
( 4 ?) indicate decreases. For example, the first line of the 
display in figure 3-9 indicates that the average reading 
score for male 9-year-olds in 2004 was higher than in 
both 1999 and 1971. 



Figure 3-9. Summary of trends in reading and mathematics 
average scale scores for students ages 9, 13, 
and 17, by gender: 1971-2004 



Reading 



Male 



^ 9-year-olds’ average scale scores since 1971 
^ 13-year-olds’ average scale scores since 1971 
17-year-olds’ average scale scores since 1971 
Female 

^ 9-year-olds’ average scale scores since 1971 
^ 13-year-olds’ average scale scores since 1971 
17-year-olds’ average scale scores since 1971 



( ^ since 1999) 

( ^ since 1999) 
( since 1999) 

( ^ since 1999) 

( ^ since 1999) 
( ^ since 1999) 



Mathematics 



Male 

^ 9-year-olds’ average scale scores since 1973 ( ^ since 1999) 
^ 13-year-olds’ average scale scores since 1973 ( ^ since 1999) 
17-year-olds’ average scale scores since 1973 ( since 1999) 
Female 

^ 9-year-olds’ average scale scores since 1973 ( ^ since 1999) 
^ 13-year-olds’ average scale scores since 1973 ( ^ since 1999) 
^ 17-year-olds’ average scale scores since 1973 ( * since 1999) 



^Significantly higher in 2004. 

■^Indicates no significant difference between earlier year and 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), selected 
years, 1971-2004 Long-Term Trend Reading and Mathematics Assessments. 



48 



CHAPTER 



3 



Figure 3-10. Summary of trends in reading and mathematics 
average scale scores for students ages 9, 13, 
and 17, by race/ethnicity: 1971-2004 



Reading 

White 

^ 9-year-olds’ average scale scores since 1971 ( ^ since 1999) 

^ 13-year-olds' average scale scores since 1971 ( * since 1999) 

* 17-year-olds’ average scale scores since 1971 ( * since 1999) 

Black 

^ 9-year-olds' average scale scores since 1971 ( ^ since 1999) 

^ 13-year-olds’ average scale scores since 1971 ( * since 1999) 

^ 17-year-olds’ average scale scores since 1971 ( * since 1999) 

Hispanic 

^ 9-year-olds’ average scale scores since 1975 ( ^ since 1999) 

^ 13-year-olds’ average scale scores since 1975 ( * since 1999) 

^ 17-year-olds' average scale scores since 1975 ( * since 1999) 

Mathematics 

White 

^ 9-year-olds’ average scale scores since 1973 ( ^ since 1999) 

^ 13-year-olds' average scale scores since 1973 ( ^ since 1999) 

^ 17-year-olds' average scale scores since 1973 ( * since 1999) 

Black 

^ 9-year-olds' average scale scores since 1973 ( ^ since 1999) 

^ 13-year-olds' average scale scores since 1973 ( ^ since 1999) 

^ 17-year-olds’ average scale scores since 1973 ( * since 1999) 

Hispanic 

^ 9-year-olds' average scale scores since 1973 ( ^ since 1999) 

^ 13-year-olds' average scale scores since 1973 ( ^ since 1999) 

^ 17-year-olds’ average scale scores since 1973 ( * since 1999) 

^Significantly higher in 2004. 

■^Indicates no significant difference between earlier year and 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), selected 
years, 1971-2004 Long-Term Trend Reading and Mathematics Assessments. 



Figure 3-11- Summary of trends in reading and mathematics 
average scale scores for students ages 13 and 
17, by student-reported parents’ highest level of 
education: 1978-2004 



Reading 

Less than high school 

13-year-olds’ average scale scores since 1980 ( * since 1999) 

* 17-year-olds’ average scale scores since 1980 ( * since 1999) 

Graduated from high school 

* 13-year-olds’ average scale scores since 1980 ( * since 1999) 

17-year-olds’ average scale scores since 1980 ( * since 1999) 

Some education after high school 

13-year-olds’ average scale scores since 1980 ( * since 1999) 

* 17-year-olds’ average scale scores since 1980 ( ♦ since 1999) 
Graduated from college 

* 13-year-olds’ average scale scores since 1980 ( * since 1999) 

17-year-olds’ average scale scores since 1980 ( * since 1999) 

Mathematics 
Less than high school 

^ 13-year-olds’ average scale scores since 1978 ( * since 1999) 

^ 17-year-olds’ average scale scores since 1978 ( * since 1999) 

Graduated from high school 

^ 13-year-olds’ average scale scores since 1978 ( ^ since 1999) 
17-year-olds’ average scale scores since 1978 ( * since 1999) 
Some education after high school 

^ 13-year-olds’ average scale scores since 1978 ( ^ since 1999) 

* 17-year-olds’ average scale scores since 1978 ( * since 1999) 
Graduated from college 

^ 13-year-olds’ average scale scores since 1978 ( ^ since 1999) 

^ 17-year-olds’ average scale scores since 1978 ( * since 1999) 

^Significantly higher in 2004. 

■^Indicates no significant difference between earlier year and 2004. 

♦Significantly lower in 2004. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), selected 
years, 1978-2004 Long-Term Trend Reading and Mathematics Assessments. 



Students 1 home and learning 
environments have changed 
over time. Students have greater 
access to computers and are 
taking more upper-level math- 
ematics classes. Students are also 
reading more in school and for 
homework. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



49 



Chapter 4 

Trends in Students’ School and Home 
Experiences 



In examining trends in students’ academic achievement, it is important also 
to consider the context of their learning. The context of learning today has 
changed since the assessment was first administered in the early 1970s. For 
example, computer technology plays a greater role in education as schools 
improve their infrastructure, use multimedia in their classrooms, and 
encourage students to explore research topics on the Internet. Calculators 
are used more often in the classroom, and algebra is being taught in earlier 
grades than it was three decades ago (Braswell et al. 2005). 



Home environments have changed as well. Contextual variables such as 
availability of computers in the home or parental involvement may affect 
student learning (Cai, Moyer, and Wang 1997; Downes and Reddacliff 
1997; Rathburn, West, and Hausken 2003). As part of NAEP’s long-term 
trend assessments, students have responded to a variety of questions about 
their school and home experiences. The information gained from these 
responses provides insight into the activities and experiences that form the 
contexts in which students learn. This chapter highlights students’ responses 
to NAEP background questions about several key factors associated with 
student achievement. 



In the following sections, data are presented to show each variable’s rela- 
tionship to scores on the 2004 NAEP reading and mathematics long-term 
trend assessment. Different background questions were asked for read- 
ing and for mathematics, so the two sections highlight different variables. 
Trends associated with contextual factors are presented two ways. First, the 
relationship between the variable and the average NAEP score is examined. 
It should be noted, however, that a relationship between NAEP scores and 
students’ responses to certain questions does not establish a causal relation- 
ship between a particular factor and student achievement. The relationship 
may be influenced by a number of other variables not accounted for in this 
report, such as family income or students’ attitudes. In addition, the infor- 
mation examined here is based solely on student self-reports, which may 
vary in accuracy across ages and students. 

Second, the contextual variable is shown on its own to clarify how stu- 
dents’ responses to the background questions have changed over time. That 
is, the percentages of students selecting each response option in 2004 are 
compared with those from the first assessment year in which the question 
was asked. (The comparison year varies by question.) These comparisons, 
even without the associated performance scores, demonstrate how the con- 
text of education has changed over time. 



50 



CHAPTER 



4 



Contextual Factors Associated With 
Reading 

Students responded to several questions relating to 
reading as they took the long-term trend assessment. 
This chapter reports on three variables associated with 
reading: the amount of time spent on homework, the 
number of pages read per day for both school and 
homework, and the amount of time spent reading for fun. 

Amount of Homework 

The first of two background questions pertaining to 
homework on the reading assessment is discussed in 
this section. Specifically, the question relating to time 
spent on homework asked, “How much time did you 
spend on homework yesterday?” The possible responses 
included the following: 

► No homework was assigned. 

► I had homework but didn’t do it. 

► Less than 1 hour 

► 1 to 2 hours 

► More than 2 hours 

This question was asked at age 9 in assessment years 
1984 through 2004 and at ages 13 and 17 in assess- 
ment years 1980 through 2004. Figure 4-1 shows the 
average reading scores in 2004 by the amount of time 
spent on homework for all three age groups, and fig- 
ure 4-2 shows the trend in the percentages of students 
across the three age groups reporting they spent varying 
amounts of time on homework. 



Figure 4-1. Average reading scale scores for students ages 
9, 13, and 17, by amount of time spent on 
homework: 2004 



Age 9 

Time spent on 
homework 

Did not have homework 
Did not do homework 
Less than 1 hour 
1 to 2 hours 
More than 2 hours 




0 200 220 



240 260 280 

Scale score 



300 320 500 




0 200 220 240 260 280 300 320 500 

Scale score 




0 200 220 240 260 280 300 320 500 

Scale score 



SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), 2004 
Long-Term Trend Reading Assessment. 

How to interpret this graphic . . . 

The graphics in this chapter differ from those in previous chapters in that the scale scores have been placed on the 
horizontal axis rather than on the vertical axis. The categories of the contextual variable analyzed are on the verti- 
cal axis. Thus , in figure 4-1 , the five categories of “time spent on homework” are shown in order of amounts of time 
on the vertical axis , with the horizontal bar showing the average score for each category. For example , at age 17, 
students who did not have any homework had an average score of 270, and the average scores increased with each 
category of homework, up to 304 for the “more than two hours” category. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



51 



At all three ages less than one hour was the most 
commonly reported amount of time spent on home- 
work the previous day (figure 4-2). However, the 
relationship between the amount of time spent on 
homework and average score on the NAEP reading 
assessment differed across the ages. In 2004, the average 
score of 9-year-olds who spent less than one hour on 
homework was higher than the average scores of stu- 
dents who did not do the assigned homework or who 
spent more than two hours on homework. The rela- 
tionship between homework and achievement was more 
straightforward at age 13. In 2004, the average scores 
for 13-year-olds who spent either one to two hours 
or more than two hours on homework were higher 
than the average scores for their peers who spent less 
than one hour on homework, did not do their home- 
work, or did not have any homework to do. At age 17, 
higher average scores on the long-term trend reading 
assessment were associated with more time spent on 
homework. That is, in 2004, those students who spent 
more than two hours on homework had higher average 
scores than those who spent one to two hours, whose 
scores were higher in turn than those of students who 
spent less than one hour, whose scores were higher than 
those of students who did not do any homework. 



How to interpret this graphic . . . 

The other type of graphic used in this chapter is a 
percentage distribution bar. Figure 4-2 shows the 
percentage of students who chose each category of a 
question , and the percentages add up to 100 percent 
of the assessed students. The years shown include the 
first years the question was asked (1980 and 1984), 
1999, and 2004. So, figure 4-2 shows that at age 
9 the percentage of students who reported that they 
spent less than one hour on homework was 41 per- 
cent in 1984 and 53 percent in 1999, both of which 
were lower than the 59 percent reported in 2004. At 
the same time, the percentage of students who report- 
ed they did not have any homework decreased from 
35 percent in 1984 to 21 percent in 2004. 



Figure 4-2. Percentages of students ages 9, 13, and 17, by 
amount of time spent on homework: 1980, 1984, 
1999, and 2004 



Did not have Did not do Less than 1 to 2 More than 
homework homework 1 hour hours 2 hours 



Age 9 



Time spent on homework 




Percent 




Age 17 



Percent 



26 8 



26 8 




Percent 



*Significantly different from 2004. 

NOTE: Detail may not sum to totals because of rounding. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), 1980, 1984, 
1999, and 2004 Long-Term Trend Reading Assessments. 














52 



CHAPTER 



4 



In 2004, a greater percentage of 9-year-olds indicated 
that they spent less than one hour on homework than 
in any other year in which the question was asked. 
Simultaneously, the percentage of students indicating 
either that no homework was assigned or that they did 
not do any homework decreased between 1984 and 
2004. The percentage of 13-year-old students spending 
less than one hour on homework has increased, from 
32 percent in 1980 to 40 percent in 2004. At the same 
time, the percentage of students reporting that they did 
not have any homework decreased from 30 percent in 
1980 to 20 percent in 2004. At age 17, the percentage 
of students indicating they spent less than one hour on 
homework the previous day increased from 24 to 28 
percent between 1980 and 2004. At the same time, the 
percentage of 17-year-olds reporting that they were not 
assigned homework decreased from 32 to 26 percent. 

Pages Read Per Day 

As part of the reading background questionnaire, stu- 
dents at all three ages were asked about the number of 
pages they read in school and for homework each day. 
The response options included the following: 

► 5 or fewer 

► 6 to 10 

► 11 to 15 

► 16 to 20 

► More than 20 

This question was first presented to students at ages 
9, 13, and 17 in 1984. Figure 4-3 shows the average 
reading scores in 2004 by the number of pages read per 
day for all three ages, and figure 4-4 shows the trend 
in the percentage of students reporting reading various 
numbers of pages per day across the three ages. 



Figure 4-3. Average reading scale scores for students ages 9, 
13, and 17, by pages read per day in school and 
for homework: 2004 



Age 9 




0 200 220 240 260 280 300 320 500 

Scale score 




0 200 220 240 260 280 300 320 500 

Scale score 



Age 17 




0 200 220 240 260 280 300 

Scale score 



// 

320 500 



SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), 2004 
Long-Term Trend Reading Assessment. 




NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



53 



In 2004, at ages 9, 13 and 17 students who indicated 
that they read 5 or fewer pages a day had lower reading 
scores than students in any other category; however, for 
students at ages 9 and 13, there were no differences in 
the average reading scores among students who read at 
least 6 pages a day. That is, students who indicated that 
they read more than 20 pages a day did not have read- 
ing scores that were measurably different from students 
who indicated they read 6-10, 11-13, or 16-20 pages 
per day. At age 17, there is a more linear relationship 
between the number of pages read per day and average 
reading scores. For example, students who read more 
than 20 pages a day had higher average reading scores 
than students who read 11-15, 6-10, or 5 or fewer 
pages a day. Students who selected any one of the four 
options indicating they read at least 6 pages a day had 
higher average scores than students who read 5 or fewer 
pages. 

At age 9, the trend over the past 20 years has shown 
an increase in the number of pages students read for 
school and homework. Specifically, fewer students indi- 
cated that they read 5 or fewer pages in 2004 than in 
1984. Likewise, the percentage of students indicating 
that they read more than 20 pages a day increased from 
13 percent in 1984 to 25 percent in 2004. Similarly, a 
greater percentage of students at age 13 indicated that 
they read at least 16 pages per day in 2004 than in 
1984. The percentage of 13-year-olds indicating they 
read either fewer than 5 pages or 6-10 pages decreased 
between 1984 and 2004. At age 17, there were no mea- 
surable changes in the percentage of students indicating 
various numbers of pages read per day over the 20-year 
period. In 1984, 1999, and 2004, between 21 and 23 
percent of 17-year-olds indicated that they read more 
than 20 pages per day, and another 21 to 23 percent 
said they read 5 or fewer pages per day. 



Figure 4-4. Percentages of students ages 9, 13, and 17, by 
pages read per day in school and for homework: 
1984, 1999, and 2004 



5 or fewer 6 to 10 11 to 15 16 to 20 More than 20 



Age 9 



Pages read per day 



Age 13 



14 13 * 13 * 




Percent 




Age 17 



Percent 



2004 



18 14 21 




Percent 



*Significantly different from 2004. 

NOTE: Detail may not sum to totals because of rounding. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), 1984, 1999, 
and 2004 Long-Term Trend Reading Assessments. 












54 



CHAPTER 



4 



Reading for Fun 

Students at all three age levels were asked, “How often 
do you. . .read for fun on your own time?” The possible 
responses included the following: 

► Almost every day 

► Once or twice a week 

► Once or twice a month 

► A few times a year 

► Never or hardly ever 

Responses are available for reporting from 1984 
through 2004 at all three ages. Figure 4-5 shows the 
relationship between the amount of time spent reading 
for fun and average reading scores. 



At all three ages, students who indicated that they 
read for fun almost every day had higher average scores 
in 2004 than those who said that they never or hardly 
ever read for fun. Students at all three ages who said 
that they read for fun once or twice a week also had 
higher average scores than those who never or hardly 
ever read for fun. At ages 13 and 17, those who read 
for fun almost every day had higher average scores than 
those who read for fun once or twice a week. 

As seen in figure 4-6, at age 9 the only category 
showing a measurable change during this period was 
an increase in the percentage of students who indicated 
that they read a few times a year — up from 3 percent 
in 1984 to 5 percent in 2004. At ages 13 and 17, the 
percentage saying they read for fun almost every day 
was lower in 2004 than in 1984. This trend accompa- 
nied an increase over the same 20-year time period in 
the percentage indicating that they never or hardly ever 
read for fun. 



NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



55 



Figure 4-5. Average reading scale scores for students ages 9, 
13, and 17, by frequency of reading for fun: 2004 



Figure 4-6. Percentages of students ages 9, 13, and 17, by 
frequency of reading for fun: 1984, 1999, 2004 



Age 9 



Frequency of reading 
for fun 

Almost every day 
Once or twice a week 
Once or twice a month 
A few times a year 
Never or hardly ever 




Age 13 



Frequency of reading 
for fun 

Almost every day 
Once or twice a week 
Once or twice a month 
A few times a year 
Never or hardly ever 




// 

300 320 500 



// 

300 320 500 



Age 17 



Frequency of reading 
for fun 

Almost every day 
Once or twice a week 
Once or twice a month 
A few times a year 
Never or hardly ever 




0 200 220 240 260 280 300 320 500 

Scale score 



Almost 
every day 



Once or Once or A few times Never or 
twice a week twice a month a year hardly ever 




Frequency of reading for fun 



Age 9 



Year 

1984 

1999 

2004 



53 28 7 3 * 9 




Percent 



Age 13 




Year 31 * 33 17 10 * 9 * 

1984 

1999 
2004 




25 28 19 12 16 




22 30 15 14 19 




Percent 



{Reporting standards not met. Sample size is insufficient to permit a reliable estimate. 
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), 2004 
Long-Term Trend Reading Assessment. 



^Significantly different from 2004. 

NOTE: Detail may not sum to totals because of rounding. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for 
Education Statistics, National Assessment of Educational Progress (NAEP), 1984, 1999, and 
2004 Long-Term Trend Reading Assessments. 













56 



CHAPTER 



4 



Contextual Factors Associated With 
Mathematics 

Students responded to several background questions 
relating to mathematics as they took the long-term 
trend assessment. This section reports on four types 
of factors associated with mathematics: course-taking 
patterns, availability of and amount of time spent on 
computers in mathematics studies, frequency of home- 
work, and television-watching patterns. Each of these 
factors is analyzed to determine how it relates to per- 
formance in mathematics as measured by the long-term 
trend assessment and how the responses to these ques- 
tions have changed over the past two to three decades. 



Course-Taking Patterns 

Questions on mathematics courses were given to stu- 
dents in the long-term trend background questionnaire 
at ages 13 and 17. At age 13, the question read: “What 
kind of mathematics class are you in this year?” The 
response options were the following: 

► I am not taking mathematics this year. 

► Regular mathematics 

► Pre-algebra 

► Algebra 

► Other 

In 2004, almost all 13-year-olds said that they were 
taking some mathematics course, and only 6 percent 
indicated that they were taking a mathematics class 
other than the ones listed (see figure 4-8). The remain- 
der of the students at age 13 was split almost evenly 
among the choices of regular mathematics, pre-algebra, 
and algebra. 

It was not possible to determine any variation in con- 
tent or difficulty of mathematics classes across schools. 
As seen in figure 4-7, among those subjects, more 
advanced mathematics courses were associated with 
higher scores on the 2004 long-term trend mathemat- 
ics assessment. That is, students who were in algebra 
scored higher than those in pre-algebra, who scored 
higher than those in regular mathematics classes. 



Figure 4-7. Average mathematics scale scores for students 
age 13, by type of mathematics course: 2004 



Age 13 




$ Reporting standard not met. Sample size is insufficient to permit a reliable estimate. 
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), 2004 
Long-Term Trend Mathematics Assessment. 

Figure 4-8 shows the trends in mathematics course- 
taking patterns at age 13 from 1986 through 2004. 
Overall, more 13-year-olds are enrolled in algebra, up 
from 16 percent in 1986 to 29 percent in 2004 — a 
higher percentage of students than in any previous 
assessment year. The percentage in pre-algebra has 
also increased from 19 percent in 1986 to 32 percent 
in 2004, while the percentage in regular mathematics 
decreased from 61 percent in 1986 to 33 percent in 
2004. 

Figure 4-8. Percentage of students age 13, by type of 

mathematics course: 1986, 1999, and 2004 



Regular Not taking 

mathematics Pre-algebra Algebra Other mathematics 



Mathematics course 

Age 13 



Year 61* 19* 16* 5 # 




#The estimate rounds to zero. 

*Significantly different from 2004. 

NOTE: Detail may not sum to totals because of rounding. 

SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), 1986, 1999, 
and 2004 Long-Term Trend Mathematics Assessments. 






NAEP 2004 TRENDS IN ACADEMIC PROGRESS 



57 



At age 17, the question was worded differently to 
focus on all mathematics classes taken. The question 
read: “Counting what you are taking now, have you 
ever taken any of the following mathematics courses?” 
Students indicated that they had or had not taken each 
of the following subjects: 

► General, business, or consumer mathematics 

► Pre-algebra or introduction to algebra 

► First-year algebra 

► Second-year algebra 

► Geometry 

► Trigonometry 

► Pre-calculus or calculus 

The most advanced mathematics class checked by 
the students was recorded as the highest level of math- 
ematics taken. 

The majority of students at age 17 (53 percent) indi- 
cated that the highest level of mathematics they had 
taken was second-year algebra (figure 4-10). Only 4 
percent had not yet taken algebra, and 17 percent had 



How to interpret this graphic . . . 

Each variable in this section has two graphics. The 
first graphic, such as figure 4-7, shows the different 
categories of responses with horizontal bars showing 
the average score for each category. The second graph- 
ic, such as figure 4-8, shows the percentage of stu- 
dents selecting each response category in the first year 
the question was asked and in 1999 and 2004. The 
percentages should add up to 100 percent of assessed 
students but may not be exact due to rounding. 



taken calculus. As seen in figure 4-9, the highest level 
of mathematics taken was positively associated with 
average scores on the 2004 long-term trend assessment, 
That is, students who had taken calculus had a higher 
average score than those whose highest mathematics 
class was second-year algebra. Those who took algebra 
II had a higher average score than those whose highest 
class was geometry, and geometry students outper- 
formed algebra I students. Pre-algebra students had a 
lower average score in mathematics than students who 
had taken any mathematics course beyond pre-algebra. 

Figure 4-9. Average mathematics scale scores for students 
age 17, by highest mathematics course taken: 
2004 




0 220 240 260 280 300 320 340 500 

Scale score 



X Reporting standards not met. Sample size is insufficient to permit a reliable estimate. 
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center 
for Education Statistics, National Assessment of Educational Progress (NAEP), 2004 
Long-Term Trend Mathematics Assessment. 



